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mm%] mmm 

[#fFff#©«M • 
.[fNtSl] KTffl (a) (b) tc^fr^ y^IEJ!!fr<D&Sr£:S:# 

(a) SH»©SH^J#-^1 iCjjrt, 7^;i#^l- 3 7 8t^$fl57^;i 

(b) LT^^n^^&^^l-SJSffiSr^L' 
, (a) ©7 $;&©-•»#*£, fim^L 'jKU^^K. 

[ff#«3] #T©.(c)'*fctt (d) E*:r£a£K^>&fc 5' 

(c) @H^J*©|2W# 2 tC^t\ 1 - 1 2 9 8-e^£*l6t£SIB#!fr 

■(d) 7 ^^b^'^^&SRi: LT/r*7n S> 
S/KUX^ F&n- KU (c) ©«^6K#|©^istffc^ gm^L < litfflD 

>©^«4^«*Lfe,^KlE«lttft. 

[M^«7] f|#«6ffi«©^R«ilffi[«J:»J«*Lfca^. 

[»#£8-J W*S2X3fH«©it^©5§^?:WtS^i:.lCj:y, 
ri^ : >©4^«*««ffiTL/fc^fClKSI«[»S:^l!lt-S*iS. 

[#&0J3©i¥»M] • 

[0 0 0 1] 

[»rc©M*68«r^m 
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[0 0 0 2] 
[«£*©««] 

& U £ {C # £ & & * M IS £ & £ « "C'&'S . ' * 7 x >f > 3f -tf- y * y gf 

camp* & $ -e $ * }c £ u , n*f^ffl & tfmmmm © /cit^M & * t * * 
#©s*§& if *Mtc» < t>K ±& l fcrnzmmmx* it^m * as fcaic- 7 x" 

[0 0 0 3] 

% y x >f > 1/ x a - k - £ #■ s £ ft (c % y x >f > © £ £ A 31 » fc -* s 
BWT*> ^^^^^^^©^^(C^^t-^ite^^^M^^T^feo. 
01 (Advances in Botanical Research, Vol.30, Academic Press (1999) pl49 

«t»j»«) (c n-k-M»{cfc^s^7x>r>^^©SJ&^^-r =. ®nc 

$©V-f ^-&S5&&> ffl-eriaVf . -HI © 2® BKa*S*lS8KC, = 5 r'^> h 
S/^fr<3 7-^^;u.^-9->^>, ft^n5 >SrjgfiLT*7x'-Y y%$if&?Z> 
6 ft'Tfcy, .d©iKKrtt3-t-M«[«jK:feWS*7x^>^ 
/*©'££&»•??& S. :0A7i>f> ©£M£-&$«©#¥li 3 OT© N 
■ f - Mt SIS T & »J , 3 ft e> © N - ;* =f- fribR fS tt s - 7 t 1 y to * f - * - > ft # 

gflLt*7xOS^j*tsa»' (H.l©3g|) t;#£LTV^#\ 3© 

a»©*#tt*$35fe4»©T*tt&^o v -*^;b*-tf->^>£-£j&-rs*!0©*f- 
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(®^Pfi## WO 97/35960 ) . bfr U fll2apg, £ 3 «Pg©*^Mfc 

A y<D £ H# * 5 tc # 7 x >f > ic W-^t 5 #3R©3t£-?- K 

[0 0 04] 
'.tM&jBfcfSfcfc©^ 

& * -5 £ * BMIT? & 6 esb ic, SiMi©ii£^0M & 8HW 
x#7*ni>©££/£ffW1fiia*iS. # 7 x>f :/©££&&»££ v^'. 

D^>©^k8^«rmsn«i:, 7 x>f >©££#* *£j|i1fi[3*is.. ±11 

#'ffl&Wt«fc«)K:v -rtf?ns>k*7x>f >©W#©£-£j*&ffl^ 
^*«*©itert«i'rs*K:tt*t«:3S«i*<»*. ft3M©*:f;i, 
'ftSfStcH-^U *7x>f ^©ftjKW&'^^icK^fSSjR&n- K"f3»£ 
LT*©38«&ffl1S[*r<5i:, ' *7x>f ^.©^^flliastCS^*^- 

A&n*. ioTx^^n^^j^ArxV >4:SflS(©^S?Stt&^LTv^5*S: 
■r^7'□^>^»©ite^ ^ •$:«Lfe«BH©»«^^v^i:# 

[0 0 0 5] 
[M©«©M] 

#3818 Mu IH^©ffi^'J#^2 {C^t, ■_£*#-£ 1 - 1 2 9 8T-^3*lSi&S 
S^&ftSitSr^ait'f ir-b-T^tr'* (Coffea arabica) fl^©7^ 

*^;t/ftip&&ttjKtSMT?»<6. 12^Jp©IS^I##2lB*©S»IB^J 

3 ffiiE# 2 0 01-3061410 
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o 

[0 0 0 6] 

$M & jg 31" i £ £ 6 . #388B J: y $ ft 5 *$©&aSK#| 5: £ * 5 it 
m 2 tl #tt & * -T .5 t) © £ + 3 Z. £ «f IB ■V & »J , E li * © £ ? 

g^L<tt#sn$tiAcae^i:«, sH^j#-^2(c^i-maie^cfevNT i 01 

#J#-5f 1 iC^-Tite^i: 9 0 %Jgmt, L < It 9 5 %JS*±. XtC^* L < li 9 
9 %JSLfc©ffiW:f£ -^©*at&itfc^&, X h U > 

ft* fc, -7 - ^f LTf^^-n 5 >&4^l*t*,. 

. u$y&jmm<?>ttm:*i* sjku^^ K&n- N- sissy, ,#?w©igHi*j' 

[0 0 0 7] 

35IC*»»«, SB^m©IS^J#-^ 1 75 1 - 3 7 

S.T^^KBB^^iiS^fcSr^atfrS, =r-t:-T^tr* (Coffea arabica 
) S^©-r^^D5>^S^©^y^^KT'fe5o ffi>J##lfc3<f stf'J* 

^1-75 smmmzis^T 1 orajsiT. #*i/<tt7.«grF, jgtc^*L<n3 

' 9 0 %J£<±, §^<!i9 5 %Jg*±, . Kid*?* L<tt9 9 %JK±©ffiRfttS:-^ 

*s. *©«*&# U Kfc, 7-^5 L ;i/^^>f->$:SS^ tT7 1 ^yD5 
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A B!#I*©S£!I## 3 , gB#I#-&5, S^J##7tc^1-JKy<^.K 
tt, 3-t-r5lf* (Coffea arabica) frfeflffctU U#I*©K?!f#-Sf 1 ©7 

[0 0 0 8] 

o *»9!©"r'*^n^>^»3Rare-?-tt, ±3*l/fcfiftK::3-fc:-79fcr#©f 1 

£^©?8^£OTJ1-3^i:lCJ: U, ffi^f{CfeVNT^^n ^ >©^^§:fflI0Jb 
T\ .ffi&tCfc^T-rtf^n'S >.fc#7x'>f >©£.*fcffiTS-fr*3fc#BrtB-C& 

tl — Tvtf^J (Coffea arabica) , a — b: — jj % ~7 -ft — *y (Coffea canephora) 
, (Coffea liber ica ) ^a-t-f^i^W (Coffea d 

ewevrei ) fffln- k -MUWW 
[0 0 0 9] 

^^©HfefCfc^T, *|§B^(D»^©|gJ|S:«If[|t-§*{Cj:y,.^^-^a 

X'ZZo. ry^-zyzmte^tte. &sjtexi:fl^uT^*DNA. ©«¥£&? 
**©rtNAfcfflffitt«:W'tsfe«>jc, ry+tyxmfc^immm 

[0010] 1 

■£©&£*>, 3tfi-?-»3®aiffi!l©*l5Sjyv^<o^&-nTV^S. swite^©* 
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. tit. ^*^RNA V\ 5 Double-stranded RNA interference (RNAi) -££#31 
fc?%m£>Wlffl\ZM®T*$>ZZ.t:t>K jS^-fffctlT^T^S (Chiou-Fen Chuang 
et al. PNAS (2000) vo 1.97, 4985-4990 ) „ Zl^itRNA «ffl#I#MlftfCit<|£^3g 

. d©«fc-*flRNA &E^S#&#RNAHgT?&»J, *3SiStfi»A^i/3 9J?a 

§THSo RNA'ijgfcJ: 'J Ife? Wltll'JS tlSlittl ^ fit ^4 WA\ 

[0 0 1 1] 

> 7-*^;i/^>^>£SfC£LT^;r7 r n^ 

W0 97/35960 ) £&K#»tJ!©*e^ fcttftK^A"* S 3 h it. #(C 

■^S&tf&siiSfcfts. 7-^5 L ;i/^if>^>$:^i-'g)0*©vStt«ii$ti 

[0 0 12] 

*e.*»J, 09;UipBI121*tfW&*iS#; * ft JC K£ S *i £ *j©T*H:fc^, 
©J:e>ft^#-£, WAtfrVny^^U ABIC*ALT, #;i/XXli#rtfi 

6 fflfE4$ 2001-3 061410 
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2000-245485 icfcV^T, 3- £7**0 A^x U ? 
*7 7^X>XEHA101fC«£it3 ' illCj: U, K^»*T?JgJMK&'Z?£&#i£ 
&«§LTfe»J, <f#JW2000-245485 iZtotiL £ 

[0013] 

[Hd&M] 
' (PCR JCk-Sifli). 

-fffir^>'x^l/-h •■ ^"U ifjl 9 l/Tt? K (Forward 75^fV-, GGITGY 
DSIDSIGGICCIAAYAC; Reverse 75^f7-, ARIYKIYYRTRRAAISWICCIGG ) TC 
SI (Kato et al 2000, GenBank accession no. AB031280 ) t 2-D(Dt/U 4 ^± 
Xi-omm^ViZyrtVn (Z99708&tfAC008153) ©HT?fiH£SftfciB«lca£ 
:5V*T£j*L£. ,Zfte)©t'J =T5t^.-l/^ K« v GC(A/S)(A/S)GPNT£;, PGSF(H 
/Y)(G/K)(R/N)LF H^o7* JWMm^. 3rt-v75!^ 
# (Coffea arabica) ©cDNA&tf±fB© . 1 II©75 >f &-£tr25 /i 1 ©£ffc*g 
JKTO^frTWR'.&ffbfc. HP*., 94TCT.l#IBIiRJ&'l/fcf&, 94°C 
•CaO^ISfCDjgffiiRjj:, 52°CT*30W©7-- U >4A 72°CT* 1 £IB!©#£jRj8;© 
' »y>gU>30W *;WtV\ JSfC72tt' 7#|H©£*|#;R£j£'&f?^fcVO*# 
t% PCR Sffofc. J**S3*lfc*il270 ftSftOcdNABrtffcffl^T, cDNA5-f.75 

[0014]. 

(cDNA5 A 75 U - 0)4191 £ <fc tf S iftcDNA© 7. V - - > 40 
□ -tl- (Coffea arabica) ©3gV^fr ib^RNA fcJtffflU ol igo-dT;2j 5 A ( 
Pharmacia ) JC«fc *J "RNAJcftSI Lfed ZAPII cDNA synthesis kit (Stratagene) 
&fflV*r«RNAfr*>cDNA&-&J5ftU AZAPII X^^-AiAU 77-^7>f^7 
U - fcffsSL fc. ±3B©*Bli 3 *tfc Wt# £ 70 - 7lCcDNA5.-f 75 y - & * £ y 

tt 4 o © £ u fe 47 n - > ic ft JB ■* 3 z t tfi W <b £ & o .. 
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[0 0 15] 

-eti-etKBfwe&y, t^t^±&cmkK.^^'n^y# k #6, # 35 

*lfccDNA©&gi!?!J&v IE«©IB^J#^2MBI2D(C^to ^a^©^ ■ 
-V>U-7 r 'f^^7l/-Attm»#3 2- 1 1 6.8T?»-y,.*©««lc'J:y 
n - F $ ft 5 » 7 5 ) BfcSJfll S , SH#Iii©SB»# 1 JC^1% * fc» ? n - > 

„ siie^t-y>y-fi>^7i/-Aiiiiff 14-1 1 7 ltfey 

^OT2BlCBto MtSfc£*'©#-V>y -Tjy>f7 1/ 4 4. . 

-i2oit^»j> -e<z)«{cj:yn- K^fis^r^ mnmo) 

ffiW^f SKBto #n-S/#3 5fcoV*T#&*lfccDNA©ifei£Bi*J£, 

IB#l£©'IB#l## 8 fttfB 2 C \zBt, ^Mfc^©*- U -r< > ^ 1/ 

-Aiiss#-t4 5.- 1 1.6 3 v&v\ -e©«{cj:yn-F$tis«r^/ 

IMB#f&'* K#J£©ffi#l#* 7 IC**.' RT, *B->#4 5&MXMT1 t, 9 U 

-yt l s-MTLii:, ' Vu-yt 6 &mtl2£: v vu-yt 3 5 £MTL3t, ififi^S: 
. [o o i 6] ' . 

MXMTl , MTL1, MTL2, &m^U1$n- &7 1 JWlMWT? *y V*. 
[0017] 

■ (gst w&zynvRtoQfiL) 

MTL1 1) , : MTL2 (^D-># 6) , MTL3 (^n-># 3 5) fi«fc 

tfMXMTl (^n->#4 5) ©*-^>U -r>r >^*7l/-A$:^tfM^5:PCR( 

8 ffiE# 2 0 0 1-30614 
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polymerase chain reaction) 7? fgiteL, £>-&pGEX 4T-2 # - (Pharmaci 

a ) /vasf n-b>^b; *mm (jmos.) ^iiiiLfe. #e>*ifc*»a 

^r^tfi/U^^^-^LB^MT'^*^, OD600' #0.5 iCJtUfe^ liS^ 
lmM IC&S ioiCIPTG (isopropyl thio- £ -D-galactoside) $fp{C16. 

ffl^TGST (glutathione S-transf erase ) |^^!>/^gi:bti||bfe G ' # 
>A°^SaS«Bradford^(Cj; tjftlfc. &GST * > A * R (500 ng) £ 
SDS-PAGE ( K^S/^/flSSf h A>K U 7* 'M7$ K^;i/*teifr) T'^i 

' 2ttMTL3ft*CDil|-&gafCS:,' U- > 3 &MXMT1 tft5fc©i&£g£IC?: ^tl^ft^ 
[0018]. 

(is ^ o v h ^7 7 c i 5 iisttcig) 

WMV U*? btf1?7 (TLC ) tG«fc»J,-»jgigtt©ilBj£&fTofc. it«e> (P ]an 
t Physiol., 1996, 98, 629-636 ) ©#&ICS^£, .SaRjSffi©^ fcffo fe = 
*ft6toi 100/il ©Mffr^ (100 mM Tris-HCl (pH7.5) ; ;200/iM ©«R ('* 

» V 4/tM 14 C fflfiS -7tV *J)\/*^ik-y. ■ 200 /iM MgCl 2 , 200 ng GST 

*Jt) &27iCT?2 ipsifrl, i mi©^nn^;i/ATttmu ?on 

tc a 5fi 1 ©50% TLC T'MP^Lfc : WWt : 

n-^y-;i/ = 2 : 1 : 4 , v/v/v ) „ Tif<R»*f^« "(Fuj iBAS2000) T? 

■nxmi/tri-frZikmLtc. *vy?y a ) 7 -t^n^-v-y^y (7-kx) 

, ^^^P^> (Tb) v 7<7*t>f>' (Px) , -x^7*U> (Tp) &IItb 
T, MTL2, MTL3MMXMT1 ffi#©^£g&H©»iR?Stt&ffl£L 15 
JCa**-. 0 5 iyMXMTl fi#©»£gaj?W:;- jl/Jp*>f>?:iIi:l 
T'5 L *^n^>S:.-&fiai-5^«;?Stt«:^L/TV^ D - MXMT1 fi#©IB[£gBiltt ' 
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[0019]' 

il»?n7f^7 7 CHPLC).tcJ:'y, MXMT1 ©K£gBK©«^?Sffi©*lfr 
Si:, «SJSiCj:S^Mf54»©|Bl^Tofc 0 100^1 ©£« (100m 
M Tr i s-HC 1 (pH7 .5) ; 200#M ©SS (7-* *Vy* >, 
■> v ^;fr:/tU» , 50^M J SS 200 MgCl 2 , 200n 

g GSTi&-£*>^R) &271CT?2 BUS^f" 1 ml©?Dn*;i/A 

T?tfffiU *nn>Ol/AJiS:pJKU KA^ 3>-fe > h n 

n*;W>£ttfbfeo 50/xi ©i2X r^h-hij;wc^m uvfcffia&fltafc'- 

HPLC (Shodex RSpak DS-613 column) T»£ifbfe. MM»12% ©7i?h-h 

[0 0 2 0 ] , . 

>©f§3l£, S6(3t. 0 6 Att, MXMT1 ©it£gs?i£, s-7f;^ 
;* > t 7 - * ^Vl/^r+f- yf y Zmntfcfo 2 ^ *©£)£»£ HPLCT'j$? 

Art h -e* So 0 6 b it, gat LT^tf^n ^ y zwiczMffi b tz* 

^-hT'&So 06 C«, ^>hn-i^b-C, MXMfl ©ingest 

£±#), *©B£«&HPLC-eJSMfr L £f V -. h T* & 5 . 0 6 Dtt, gfp £ L T, 

ffiLfc^-Mf&.S-. El 6 Eli, S-Tr^Wft-^?.-^^* 
>f->£MXMTl ©«l#SaSti:Sf5'S*feSfB*fc. r*7'n ^ >£»bTHP 
LC^jStfLfe^-hTf^S. 0 6 AT^aj^ tlfcS^M#©^-^Mtt^ 

&'^ur%.i^©fcr-^p^«s*i&^ofe*^e>, kxmti ©stages© 
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[0 0 2 2] ■ .. . ' 

m^m . . • 

<l 2 0>Mcz>£l3i: n-t-M»Ox^^n^ >^M*5KU^^ KS: 

< 1 6 0 >IM<7)i£ : 8 . 
<2 1 0>gH?iJ#t : 1 ' • 

<2 11>WJ05§ : 3 7 8 
<.2 1 2>mW<Z>M :7^JM 
<2 1 3>|3^:Caffea arabica 

<4 0 Officii 

MELQEVLHMN EGEGDTSYAK NASYNLALAK VKPFLEQCIR ELLRANLPNI NKCIKVADLG 60 

CASGPNTLLT VRDIVQSIDK VGQEEKNELE RPTIQIFLND LFQNDFNSVF KLLPSFYRKL 120 \ . 

EKENGRKIGS CLISAMPGSF YGRLFPEESM HFLHSCYSVH WLSQVPSGLV IELGIGANKG 180 ' .<:-.■) 

SIYSSKGCRP PVQKAYLDQF TKDFTTFLRI HSKELFSRGR MLLTCICKVD EFDEPNPLDL 240 

LDMAINDLIV EGLLEEEKLD SFNIPFFTPS AEEVKCIVEE EGSCEILYLE TFKAHYDAAF 300 - ; , 

SIDDDYPVRS HEQIKAEYVA SLIRSVYEPI LASHFGEAIM PDLFHRLAKH AAKVLHMGKG 360 . 1 

CYNNLIISLA KKPEKSDV 378 



1 2 ffiHE#2 0 



0 1 - 3 0 6 1 4 1 0 
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. < 2 1 0 >U«^ : 2 
<2 1 1 >mn<D&Z' 12 9 8 
<2 1 2 >@E#I©M : M ■ . 
.< 2 1 3 >Wf& : Caffea arabica 



< 4 o o >mn 












AGCAGTCGCA 


ATTCGATTGT 


CCTGCATATG 


AATGGAGCTC 


CAAGAAGTCC 


TGCATATGAA 


60 


TGAAGGTGAA 


GGCGATACAA 


GCTACGCCAA 


GAATGCATCC 


TAGAATCTGG 


CTCTTGCCAA 


120 


GGTGAAACCT 


TTCCTTGAAC 


AATGCATACG 


AGAATTGTTG 


CGGGCCAACT 


TGCCCAACAT 


180 


CAACAAGTGC 


i rnm iii nT"rn 

ATTAAAGTTG 


CGGATTTGGG 


ATGCGCTTCT 


GGACCAAACA 


CACTT.TTAAC 


240 


AGTGCGGGAC 


a rr*rn m /""i y^i a a a 

ATTGTGCAAA 


GTATTGACAA 


AGTTGGCCAG 


GAAGAGAAGA 


ATGAATTAGA 


300 


ACGTCCCACC 


ATTCAGATTT 


TTCTGAATGA 


TCTTTTCCAA 


AATGATTTCA 


ATTCGGTTTT 


360. 


CAAGTTGCTG 


CCAAGCTTCT 


ACGGCAAACT 


CGAGAAAGAA 


AATGGACGCA 


AGATAGGATC 


420 


GTGCCTAATA 


AGCGCAATGC 


CTGGCTCTTT 


CTACGGCAGA 


CTCTTCCCCG 


AGGAGTCCAT 


480 


GCATTTTTTG 


CACTCTTGTT 


ACAGTGTTCA 


TTGGTTATCT 


CAGGTTCCCA 


GCGGTTTGGT 


540 

■i' 


GATTGAATTG 


GGGATTGGTG 


CAAACAAAGG 


GAGTATTTAC 


TCTTCCAAAG 


GATGTCGTCC 


600 


GCCCGTGGAG 


AAGGCATATT 


TGGATCAATT 


TACGAAAGAT 


TTTACCACAT 


TTCTAAGGAT 


660 


TCATTCGAAA 


GAGTTGTTTT 


CACGTGGCCG 


AATGCTCCTT 


ACCTGCATTT 


GTAAAGTAGA 


720 i 


TGAATTCGAC 


GAACCGAATC 


CCCTAGACTT 


ACTTGACATG 


GCAATAAACG 


ACTTGATTGT 


780 


TGAGGGACTT 


CTGGAGGAAG 


AAAAATTGGA 


TAGTTTCAAT 


ATTCCATTCT 


TTACACCTTC 


'840 


AGCAGAAGAA 


GTAAAGTGCA 


TAGTTGAGGA 


GGAAGGTTCT. TGCGAAATTT 


TATATCTGGA 


900 


GACTTTTAAG 


GCCCATTATG 


ATGCTGCCTT 


CTCTATTGAT 


GATGATTACC 


CAGTAAGATC 


960 


CCATGAACAA 


ATTMAGCAG 


AGTATGTGGC 


ATCATTAATT 


AGATCAGTTT 


ACGAACCCAT 


1020 


CCTCGCAAGT 


CATTTTGGAG 


AAGCTATTAT 


GCCTGACTTA 


TTCCACAGGC 


TTGCGAAGCA 


1080 


TGCAGCAAAG 


GTTCTCCACA 


TGGGCAAAGG 


CTGCTATAAT 


AATCTTATCA 


TTTCTCTCGC 


1140 


CAAAAAGCCA 


GAGAAGTCAG 


ACGTGTAAAA 


GTTTGTTTTT 


AGTTGGTTTT 


TGTGCCGTTG 


1200 


GGGGTCTTTC 


GGGTATTGTC 


GTTTTGTATT 


CGTAATAAAA 


GTGATGTGCA 


AGAATAAGAT 


1260 


ATTTAGTAGA 


ATATTTTCAT 


AAAAAAAAAA 


AAAAAAAA . 
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MELQEVLHMN GGEGEASYAK NSSFNQLVLA KVKPVLEQCV RELLRANLPN INKCIKVADL ,60 

GCASGPNTLL TVWDTVQSTD KVKQEMKNEL ERPTIQVFLT DLFQNDFNSV FMLLPSFYRK 120 

LEKENGRKIG SCLTAAMPGS FHGRLFPEES. MHFLHSSYSL QFLSQVPSGL VTELGITANK 180 

RSIYSSKASP PPVQKAYLDQ FTKDFTTFLR MRSEELLSRG RMLLTCICKG DECDGPNTMD 240 

LLEMAINDLV AEGRLGEEKL DSFNVPIYTA SVEEVKCMVE EEGSFEILYL QTFKLRYDAG 300. 

.FSIDDDCQVR SHSPVYSDEH ARAAHVASLI RSVYEPILAS HFGEAI IPDI FHREATNAAK 360 

VIRLGKGFYN NLI ISLAKKP EKSDI . . 385 
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GTCCTGCATA TGAATGGAGC TCCAAGAAGT CCTGCATATG AATGGAGGCG AAGGCGAAGC 60 

AAGCTACGCC AAGAATTCAT CCTTCAATCA ACTGGTTCTC GCCAAGGTGA AACCTGTCCT 120 

TGAACAATGC GTACGGGAAT TGTTGCGGGC CAACTTGCCC AACATCAACA AGTGCATTAA 180 

AGTTGCAGAT TTGGGATGCG CTTCCGGACC AAACACACTT TTAACCGTTT GGGACACTGT 240 . 

ACAAAGTATT GACAAAGTTA AGCAAGAAAT GAAGAATGAA TTAGAACGTC CCACCATTCA 300 

GGTTTTTCTG ACTGATCTTT TCCAAAATGA TTTCAATTCG GTTTTCATGC TGCTGCCAAG 360 

CTTCTACCGC AAACTTGAGA AAGAAAATGG ACGCAAAATA GGATCGTGCC TAATAGCCGC 420 

AATGCCTGGC TCTTTCCACG GCAGACTCTT CGCCGAGGAG TCCATGCATT TTTTACACTC 480. 

TTCTTACAGT CTTCAGTTTT TATCCGAGGT TCCCAGCGGT TTGGTGACTG AATTGGGGAT. 540 

■ CACTGCGAAC AAAAGGAGCA TTTACTCTTC CAAAGCAAGT CCTCCGCCCG TCCAGAAGGC 600 

ATATTTGGAT CAATTTACGA AAGATTTTAC CACATTTTTA AGGATGCGTT CGGAAGAGTT 660 

GCTTTCACGT GGCCGAATGC; TCCTTACTTG CATTTGTAAA GGAGATGAAT GCGACGGCCC 720 

GAATACCATG GACTTACTTG AGATGGCAAT A A ACGACTTG , GTTGCTG AGG GACGTCTGGG . 780 

GGAAGAAAAA TTGGACAGTT TCAATGTTCC AATCTATACA GCTTCAGTAG AAGAAGTAAA 840 

GTGCATGGTT GAGGAGGAAG GTTCTTTTGA AATTTTATAC TTGCAGACTT TTAAGCTCCG 900 
TTATGATGGT GGCTTCTCTA TTGATGATGA TTGCCAAGTA AGATCCCATT CCCCAGTATA 960 

CAGCGATGAA CATGCTAGAG CAGCGCATGT GGCATCATTA ATTAGATCAG TTTACGAACC 1020 

CATCCTAGCA AGTCATTTTG GAGAAGCTAT TATACCTGAC ATATTCCACA GGTTTGCGAC 1080 

GAATGCAGCA AAGGTTATCC GCTTGGGCAA AGGCTTCTAT AATAATCTTA TCATTTCTCT 1140 

TGCCAAAAAA CCAGAGAAGT CAGACATATA AAAGCTTGTT TTTAGTTGGT TTTTGTGTTA 1200 

TGGGTTGTTT TCTGATACGG GGAAAGGATT CAGTGCGGTT GGGGTTCTAT GCGAGTATTG 1260 

TACTTTTTAT ATTATTAGTT GGTGTATAAT TATTATGTTA CATTGTTATA TTCGTAATAA 1320 

AAGTGACGTA CAAAAATAAA ATATTTTCAT AAAAAAAAAA 1360 
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MELQEVLHMN GGEGDASYAK NSSFNQLVLA KVKPVLEQCV GELLRANLPN INKCIKVADL 60 
GCASGPNTLL TVRDIVQSID KVRQEMKNEL ERPTIQVFLT DLFQNDFNSV FMLLPSFYRK 120 
LEKENGRKIG. SCLIAAMPGS FHGRLFPEES MHFLHSSYSL QFLSQVPSGL VTELGITANK 180 
RSIYSSKASP PPVQKAYLDQ FTKDFTTFLR IRSEELLSRG RMLLTCICKG DEFDGPNTMD 240 
LLEMAINDLV VEGHLEEEKL DSFNVPIYAA SVEELKCIVE EEGSFEILYL ETFKLRYDAG 300 
FSIDDDCQVR SHSPEYSDEH AR A AHV ASLL RSVYEP I LAN HFGEAIIPDI FHRFATNAAK 360 
VIRLGKGFYN NLIISLAKKP EKSDI 385 
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TTTAGCAGTC 


CCAATTCGAT 


TTATGTACAA 


GTCCTGCATA 


TGAATGGAGC 


TCCAAGAAGT 


60 


GCTGCATATG 


AATGGAGGCG 


AAGGCGATGC 


AAGCTACGCC 


AAGAATTCAT 


CCTTCAATCA 


120 


ACTGGTTCTC 


GCCAAGGTGA. 


AACCTGTCCT 


TGAACAATGC 


GTAGGGGAAT 


TGTTGCGGGC 


180 


CAACTTGCCC 


AACATCAACA 


AGTGCATTAA 


AGTTGCGGAT 


TTGGGATGCG 


CTTCCGGACC 


240 


AAACACACTT 


TTAACAGTTC 


GGGACATTGT 


ACAAAGTATT 


GACAAAGTTA 


GGCAAGAAAT 


300 


GAAGAATGAA 


TTAGAACGTC 


CCACCATTCA 


GGTTTTTCTG 


ACTGATCTTT 


TCCAAAATGA 


360 


TTTCAATTCG 


GTTTTCATGT 


TGCTGCCAAG 


TTTCTACCGC 


AAACTTGAGA 


AAGAAAATGG 


420 


ACGCAAGATA 


GGATCGTGCC 


TAATAGCCGC 


AATGCCTGGC 


TCTTTCCACG 


GCAGACTCTT 


480 


CCCCGAGGAG 


TCAATGCATT 


TTTTACACTC 


TTCTTACAGT 


CTTCAATTTT 


TATCCCAGGT 


. 540 


TCCCAGCGGT 


TTGGTGACTG 


AATTGGGGAT 


CACTGCGAAC 


AAAAGGAGCA 


TTTACTCTTC 


600 


CAAAGCAAGT 


CCTCCGCCCG 


TCCAGAAGGC 


ATATTTGGAT 


CAATTTACGA 


AAGATTTTAC 


660 


CACATTTTTA 


AGGATTCGTT 


CGGAAGAGTT 


GCTTTCACGC 


GGCCGAATGC 


TCCTTACTTG 


720 


CATTTGCAAA 


GGAGATGAAT 


TCGACGGCGC 


GAATACCATG 


GACTTACTTG 


AGATGGCAAT 


780 


AAACGACTTG 


GTTGTTGAGG 


GACATCTGGA 


GGAAGAAAAA 


TTGGACAGTT 


TCAATGTTCC 


840 


AATCTATGCA 


GCTTCAGTAG 


AAGAATTAAA 


GTGCATAGTT 


GAGGAGGAAG 


GTTCTTTTGA 


900 


AATTTTGTAC 


TTGGAGACTT 


TTAAGGTCCG 


TTATGATGCT 


GGCTTCTCTA 


TTGATGATGA 


960 


TTGCCAAGTA 


AGATCCCATT 


-CCCCAGAATA 


CAGCGATGAA 


CATGCTAGAG 


CAGCGCATGT 


1020 


GGCATCATTA 


CTTAGATCAG 


TTTACGAACC 


CATCCTCGCA 


AATCATTTTG 


GAGAAGCTAT 


1080 


TATACCTGAC 


ATATTCCACA 


GGTTTGCGAC 


GAATGCAGCA 


AAGGTTATCC 


GCTTGGGCAA 


1140 


AGGCTTCTAT 


AATAATCTTA 


TCATTTCTGT 


TGCCAAAAAA 


CCAGAGAAGT 


CAGACATATA 


1200 


AAAGCTTGTT 


TATAGTTGGT 


TTTTGTGCTA 


TGGTTTGTTT 


TCTGATACGG 


GGAAAGGATT 


1260 


TAGTGCGGTT 


GGGGTTCAAA 


AAAAAAAAAA 


AAAAAAAAAA 


AAAA. 
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MELQEVLRMN GGEGDTSYAK NSAYNQLVLA KVKPVLEQCV RELLRANLPN. INKCIKVADL 60 

GCASGPNTLL TVRDIVQSID KVGQEKKNEL ERPTIQIFLN DLFPNDFNSV FKLLPSFYRK 120 

LEKENGRKIG SCLIGAMPGS FYSRLFPEES MHFLHSCYCL QWLSQVPSGL VTELGISTNK 180 

GSIYSSKASR LPVQKAYLDQ FTKDFTTFLR IHSEELFSHG RMLLTCICKG VELDARNAID 240 

LLEMAINDLV VEGHLEEEKL DSFNLPVYIP SAEEVKCIVE EEGSFEILYL ETFKVLYDAG 300 

FSIDDEHIKA EYVASSVRAV YEPILASHFG EAIIPDIFHR FAKHAAKVLP LGKGFYNNLf 360 

ISLAKKPEKS DV ' 372 
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CTTTGGCAGT CCCAATTTGA TTTATGTACA AGTCCTGCAT' ATGAATGGAG CTCCAAGAAG 60 

TCCTGCGGAT GAATGGAGGC GAAGGCGATA CAAGCTACGC CAAGA'ATTCA GCCTACAATC .120 

AACTGGTTCT CGCCAAGGTG AAACCTGTCC TTGAACAATG CGTACGGGAA TTGTTGCGGG 180. 

CCAACTTGCC CAACATCAAC AAGTGCATTA AAGTTGCGGA TTTGGGATGC GCTTCTGGAC 240 

CAAACACACT TTTAACAGTT CGGGACATTG TCCAAAGTAT TGACAAAGTT GGCCAGGAAA 300 

AGAAGAATGA ATTAGAACGT CCCACCATTC AGATTTTTCT GAATGATCTT TTCCCAAATG 360 . . 

ATTTCAATTC GGTTTTCAAG TTGCTGCCAA GCTTCTAGCG CAAACTTGAG AAAGAAAATG 420 

GACGCAAAAT AGGATCGTGC CTAATAGGGG CAATGCCCGG CTCTTTCTAC AGCAGACTCT 480 ' \:i ' i 

TCCCCGAGGA GTCCATGCAT TTTTTACACT GTTGTTACTG TCTTCAATGG TTATCTCAGG . 540 ' , -| 

TTCCTAGCGG TTTGGTGACT GAATTGGGGA TCAGTACGAA CAAAGGGAGC ATTTACTCTT 600 

CCAAAGCAAG TCGTCTGCCC GTCCAGAAGG CATATTTGGA TCAATTTACG AAAGATTTTA 660 : ; f 

CCACATTTCT AAGGATTCAT TCGGAAGAGT TGTTTTCACA TGGCCGAATG CTCCTTACTT 720 J 

GCATTTGTAA AGGAGTTGAA TTAGACGCCC GGAATGCCAT AGACTTACTT GAGATGGCAA .780 

TAAACGACTT GGTTGTTGAG GGACATCTGG AGGAAGAAAA ATTGGATAGT TTCAATCTTC 840 ' 

CAGTCTATAT ACCTTCAGCA GAAGAAGTAA AGTGCATAGT TGAGGAGGAA GGTTCTTTTG 900 

AAATTTTATA CCTGGAGACT TTTAAGGTCC TTTACGATGC TGGCTTCTCT ATTGACGATG 960 

AACATATTAA AGCAGAGTAT GTTGCATCTT CCGTTAGAGC AGTTTACGAA CCCATCCTCG 1020 

CAAGTCATTT TGGAGAAGCT ATTATACCTG ACATATTCCA CAGGTTTGCG AAGCATGCAG 1080 

CAAAGGTTCT CCCCTTGGGC . AAAGGCTTCT ATAATAATCT TATCATTTCT CTCGCCAAAA 1140 ■ ■ 

AGCCAGAGAA GTCAGACGTG TAAAAGTTTG TTTTTGTGTT GGGGAAAGGA ATAAGTGCCG 1200 . 

TTGGGGGTCT TTCGGGTATT, GTGCTTTTTA TATTATATTG TTTTGTATCC GTAATAAAAG 1260 . 

TGGTGTGTAA GAATAAGATA TTTGACATAT ATTATTTTCA AAAAAAAAAA AAAAAA 1316 
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GTCCTGCATA .TGAATGGAGC TCCAAGAACT CCTGCATATG AATGGAGGCG AAGGCGAAGC AAGCTACGCC AAGAATTCAT CCTTCAATCA 90 

ACTGGTTCTC GCCAAGGTGA AACCTCTCCT TGAACAATGC CTACGGGAAT TGTTGCGGGC CAACTTCCCC AACATCAACA ACTGCATTAA 180 

AGTTGCAGAT TTGG6ATGCC CTTCCGGACC AAACACACTT TTAACCGTTT GGGACACTGT ACAAACTATT GACAAAGTTA AGCMGAAAT 770 

GAA6AATGAA TTAGAACG7C CCACCATTCA GGTTTTTCTG ACTGATCTTT TCCAAAATGA TTTCAATTCG CTTTTCATGC TGCTGCCAAG 360 

CTTCTACCGC AAACTTGaGA AAGAAAATGG ACGCAAAATA GGATCGTGCC TAATAGCCGC AATGCCTGGC TCTI TCCACG GCAGACTCTT 450 

CCCCCAGGAG TCaf GCAR TTTTACACTC TTCTTACAGT CTTCAGTTTT TATCCCAGGT TCCCAGCGGT TTGGTGACTG AATTGGGGAT . 548 . 

CACTGCGAAC AAAAGGAGCA TTTACTC7TC CAAAGCAACT CCTCCGCCCG TCCAGAAGGC ATATTTGGAT CAAT7TACGA AAGATTTTAC. 630 

CACATTTTTA AGGATGCGTT CGGAAGAGTT GCTTTCACGT GGCCGAATGC TCCTTACTTG CATTTCTAAA GGAGATGAAT GCGACGGCCC 720 

GAATACCATG GACTTACTTG AGATGGCAAT AAACGACTTG CTTGCTGAG6 GACCTCTGGG GGAAGAAAAA TTGGACAGTT TCAATGTTCC 810 

AATCTATACA GCTTCACTAG AAGAAGTAAA GTGCATGGTT GAGGACGAAG GTTCmTGA AATTTTATAC TTGCACACTT TTMGCTCCG 960 

TTATGATGCT GGCTTCTCTA TTGATGATGA TTGCCAAGTA A6ATCCCATT CCCCACTATA CAGCGATGAA CATGCTAGAG CAGCGCATGT 990 

GGCATCATTA ATTAGATCAG TTTACGAACC CATCCTAGCA AGTCATTTTG GAGAAGCTAT TATACCTGAC ATATTCCACA GGTTTGCGAC 1050 

GAATGCAGCA AAGGTTATCC GCTTGGGCAA AGGCTTCTAT AATAATCTTA TCATTTCTCT TGCCAAAAAA CCAGA6AAGT CAGACATATA 1170 

AAAGCTTGTT TTTAGTTGGT TTTTGTGTTA TGGCTTGTTT TCTGATACGG GGAAAGGATT CAGTGCGGTT GGGGTTCTAT CCGAGTATTG 1260 

TACTTTTTAT ATTATTAGTT GGTGTATAAT TATTATGTTA CATTGTTATA TTCGTAATAA AAGTGACGTA CAAAAATAAA ATATTTTCAT 1350 

AAAAAAAAAA ■ 1360 



B ; 

TTTAGCAGTC CCAATTCGAT TTATGTACAA GTCCTGCATA TGAATGCAGC TCCAAGMGT CCTCCATATG AATGGAGGCG AAGGCGATGC 90 

AAGCTACGCC AAGAATTCAT CCTTCAATCA ACTGGTTCTC GCCAAGGTGA AACCTCTCCT TGAACAATCC GTAGGGGAAT TGTTGCGGGC ISO 

CAACHGCCC AACATCAACA ACTGCATTAA AGTTGCGGAT TTGGGATGCG CTTCCGGACC AAACACACTT TTAACACTTC GGGACATTCT Z70 ' 

ACAAACTATT GACAAAGTTA GGCAAGAAAT GAAGAATGAA TTAGAACGTC CCACCATTCA GGTTTTTCTG ACTGATCTTT TCCAAAATGA 360 

TTTCAATTCG GTTTTCATGT TGCTGCCAAG TTTCTACCGC AAACTTGAGA AAGAAAATGG ACGCAAGATA GGATCCTGCC TAATAGCCGC 450 

AATGCCTGGC TCTTTCCACG GCAGACTCT7 CCCCGAGGAG TCAATGCATT TTTTACACTC TTCTTACAGT CTTCAAT7TT TATCCCAGGT 540 

TCCCAGCGGT TTGCTGACTG AATTGGGGAT CACTGCGAAC AAAAGGAGCA TTTACTCTTC CAAAGCAAGT CCTCCGCCCG TCCAGAAGGC. 630 

ATATTTGGAT CAATTTACGA AAGATTTTAC CACATTTTTA AGGATTCCTT CGGAAGAGTT GCTTTCACGC GGCCGAATGC TCCTTACTTG 720 

CATTTGCAAA GGAGATGAAT TCGACGGCCC GAATACCATG GACTTACTTG AGATGGCAAT AAACGACTTG GTTCTTGAGG GACATCTGGA 810 

GGAAGAAAAA TTGGACAGTT TCAATGTTCC AATCTATGCA GCTTCACTAG AAGAATTAAA GTGCATAGTT GAGGAGGAAG GTTCTTTTGA 900 

AATTTTCTAC TTGGAGACTT TTAAGCTCCG TTATGATGCT GGCTTCTCTA TTGATGATGA TTGCCAAGTA AGATCCCATT CCCCAGAATA 990 

CAGCGATGAA CATGCTACAG CAGCGCATGT GGCATCATTA CTTAGATCAG TTTACGAACC CATCCTCGCA AATCATTTTG GAGAAGCTAT 1080 

TATACCTGAC ATATTCCACA GGTTTGCGAC GAATGCAGCA AAGGTTATCC GCTTGGGCAA AGGCTTCTAT AATAATCTTA TCATTTCTCT 1170 

TGCCAAAAAA CCAGAGAACT CAGACATATA AAAGCTTGTT TATAGTTGGT TTTTCTGCTA TGGTTTCTTT TCTGATACGG GGAAAGGATT 1260 

TACTGCGGTT GGGCTTCAAA AAAAAA A A A A AAAAAAAAAA AAAA 1304 



CTTTGGCACT CCCAATTTCA TTTATGTACA AGTCCTGCaT ATGAATGGAG CTCCAACAAG TCCTGCGGAT GAATGGAGGC GAAGGCGATA 90 

CAAGCTACGC CAAGAATTCA GCCTACAATC AACTGGTTCT CGCCAAGGTG AAACCTGTCC TTGAACAATG CGTACGGGAA TTGTTGCGGG 180 

CCAACTTGCC CAACATCAAC AAGTGCATTA AAGTTGCGGA TTTGGGATGC GCTTCTGGAC CAAACACACT TTTAACACTT CGGGACATTG 179 

■ TCCAAAGTAT TGACAAAGTT GGCCAGGAAA AGAAGAATGA ATTAGAACGT CCCACCATTC AGATTTTTCT GAATGATCTT TTCCCAAATG 360 

ATTTCAATTC GGTTTTCAAG TTGCTGCCAA GCTTCTACCG CAAACTTGAG AAAGAAAATG GACGCAAAAT AGGATCGTGC CTAATAGGGG 4S0 

CAATGCCCGG CTCTTTCTAC AGCAGACTCT TCCCCGAGGA GTCCATGCAT TTTTTACACT CTTCTTACTG TCTTCAATGG TTATCTCAGG 540 

TTCCTAGCGG TTTGCTGACT GAATTGGGGA TCAGTACGAA CAAAGGGAGC ATTTACTCTT CCAAAGCAAfi TCGTCTGCCC CTCCAGAAGG 630 

CATATTTGGA TCAATTTACG AAAGATTTTA CCACATTTCT AAGGATTCAT TCGGAAGAGT TCTTTTCACA TG6CCGAATG CTCCTTACTT 720 

CCATTTGTAA AGGACTTGAA TTAGACGCCC GGAATGCCAT AGACTTACTT GACATGGCAA TAAACGACTT GGTTGTTCAG GGACATCTGG 810 

AGGAAGAAAA ATTGGATAGT TTCAATCTTC CACTCTATAT ACCTTCAGCA GAAGAACTAA ACTGCATAG7 TGAGGAGGAA GGTTCTTTTG . 900 

AAATTTTATA CCTGGAGACT TTTAAGGTCC TTTACGATGC TGGCTTCTCT ATT6ACGATG AACATATTAA AGCAGAGTAT GTTGCATCTT 990 

CCCTTAGAGC AGTTTACGAA CCCATCCTCG CAAGTCATTT TGGAGAAGCT ATTATACCTG ACATATTCCA CAGGTTTGCG AAGCATGCAG 10S0 

CAAAGGTTCT CCCCTTGGGC AAAGGCTTCT ATAATAATCT TATCATTTCT CTCGCCAAAA AGCCAGAGAA GTCAGACCTG TAAAAGTTTG U70 

TTTTTGTGTT GGGGAAAGGA ATAACTGCCG TTGGGGCTCT TTCG6GTATT CTGCTTTTTA TATTATATTG TTTTGTA7CC GTAATAAAAG 1260 

TGGTCTCTAA GAATAAGATA TTTGACATAT ATTATTTTCA AA A AAAAAAA AAAAAA 1316 



D 

AGCAGTCCCA ATTCGATTGT CCTCCATATG AATGGAGCTC CAAGAACTCC 
GAATGCATCC TACAATCTGG CTCTTGCCAA GCTGAAACCT TTCCTTGAAC 
CAACAAGTGC ATTAAAGTTG CGGATTTGGG ATGCGCTTCT GGACCAAACA 
ACTTGGCCAG GAAGAGAAGA ATGAATTAGA ACCTCCCACC ATTCAGATTT 
CAACTTGCTG CCAAGCTTCT ACCGCAAACT CGAGAAAGAA AATGGACGCA 
CTACGGCAGA CTCTTCCCCG AGGACTCCAT GCATTT7TT5 CACTCTTGTT 
GATTGAATTG GGGATTGCTG CAAACAAAGC GAGTAITTAC TCTTCCAAAG 
TACGAAAGAT TTTACCACAT TTCTAAG6AT TCATTCGAAA GAGTTGTTTT 
TGAATTCGAC GAACCGAATC CCCTAGACTT ACTTGACATG GCAATAAACG 
TAGTTTCAAT ATTCCATTCT TTACACCTTC AGCACAAGAA ctaaactgca 
GACTTTTAAG GCCCATTATG ATGCTGCCTT CTCTATTGAT GATGATTACC 
ATCATTAATT A6ATCAGTTT ACGAACCCAT CCTCGCAAGT CATTTTGGAG 
TGCAGCAAAG GTTCTCCACA TGGGCAAAGG CTGCTATAAT AATCTTATCA 
G1 H G I M II AGTTGGTTTT TGTGCCCTTG GGGCTCTTTC G6CTATTGTC 
ATTTAGTACA ATATTTTCAT AAAAAAAAAA AAAAAAAA 



TGCATATGAA TGAAGGTGAA GGCGATACAA GCTACGCCAA 90 

AATGCATAC2 AGAATTCT7G CGGGCCAACT TGCCCAACAT 180 

CACTTTTAAC AGTGCGGGAC ATTGTGCAAA GTATTGACAA 270 

TTCTGAATGA TCTTTTCCAA AATGATTTCA ATTCGGTTTT 360 

AGATAGGATC GTGCCTAATA AGCGCAATGC CTGGCTCTTT 450 

ACAGTGTTCA TT6CTTATCT CAGCTTCCCA CCGGTTTGGT 540 

CATCTCCTCC GCCCGTCCAG AAGGCATATT TGGATCAATT 630 

CACGTGGCCG AATGCTCCTT ACCTGCATTT GTAAAGTAGA 720 

ACTTGATTGT TGAGGGACTT CTGGAGGAAG AAAAATTG6A 810 

TAGTTGAGGA GGAAGGTTCT TGCGAAATTT TATATCTGGA 900 

CACTAAGATC . CCATGAACAA ATTAAAGCAG ACTATGTGCC 990 

AAGCTATTAT GCCTGACTTA TTCCACAGGC TTGCGAAGCA 16W 

TTTCTCTCGC CAAAAAGCCA GAGAAGTCAG ACCTGTAAAA 1170 

CTTTTGTATT CGTAATAAAA CTGATGTGCA AGAATAAGAT 1260 

1298 
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[Identification of Document] Specification 

[Title of the Invention] THEOBROMINE SYNTHASE POLYPEPTIDE OF COFFEE 
PLANT AND THE GENE ENCODING SAID POLYPEPTIDE 

[Claims] 

^[Claim 1] A polypeptide consisting of an amino acid sequence of following (a) 
^or(b): 

(a) an amino acid sequence defined by amino acid numbers from 1 to 378 shown in 
SEQ ID NO: 1 in a Sequence List, 

(b) an amino acid sequence in which a part of said amino acid sequence (a) is deleted, 
substituted or added with another amino acid sequence, the amino acid sequence (b) 
having the activity to biosynthesize theobromine using 7-methylxanthine as the 
substrate. 

[Claim 2] A gene encoding the polypeptide according to Claims 1. 
[Claim 3] A gene consisting of a base sequence of following (c) or (d): 

(c) a base sequence defined by base numbers from 1 to 1298 shown in SEQ ID NO: 2 
in a Sequence List, 

(d) a base sequence in which a part of base sequence (c) is deleted, substituted or 
added with another base sequence, the base sequence (d) encoding a polypeptide 
having the activity to biosynthesize theobromine using 7-methylxanthine as the 
substrate. 

[Claim 4] A transformed plant wherein expression of the gene according to Claim 
2 or 3 is decreased in the plant to inhibit biosynthsis of theobromine. 
[Claim 5] A seed obtained from the transformed plant according to Claim 4. 
[Claim 6] A transformed plant wherein gene according to Claim 2 or 3 is 
introduced to increase biosynthsis of theobromine. 

[Claim 7] A seed obtained from the transformed plant according to Claim 6. 
[Claim 8] A method for production of a transformed plant in which biosynthesis of 
theobromine is decreased by inhibiting expression of the gene according to Claim 2 
or 3. 

[Detailed Description of the Invention] 
[0001] 

[Field of the Invention] 

This invention relates to theobromine synthase polypeptide and the gene 



encoding said enzyme. 

[0002] 

[Prior Art] 

Coffee is a drink consumed all over the world with favorite and its utility 
is markedly large. On the other hand, it is known that excessive ingestion of caffeine, 
which is contained in coffee, causes harmful effects. Caffeine is one of xanthine 
derivatives and theophylline and theobromine are also the members of the xanthine 
derivatives. These xanthine derivatives are known to inhibit phosphodiesterase, 
thereby the amount of cAMP is increased. As the result, xanthine derivatives exhibit 
excitatory effect on the central nerves system and enhance function of the circulatory 
system. When they are ingested at a suitable amount, such effects of xanthine 
derivatives are useful for spiritual elevation. However, when the amount of digestion 
is excessive, they would cause harmful effects as mentioned above. Therefore, there 
has been a strong demand on production of a caffeine-less coffee all over the world. 
[0003] 

[Problems to be Solved by the Invention] 

To obtain caffeine-less coffee, attempts to obtain a gene involved in 
biosynthesis of xanthine derivatives have been performed, in the purpose to achieve 
artificial control of biosynthesis of caffeine. In Fig. 1 (cited from Advances in 
Botanical Research, Vol. 30, Academic Press (1999) pl49), the pathway working for 
caffeine biosynthesis in coffee plants is shown. In Fig. 1, the arrow with solid line 
indicates the main pathway of caffeine synthesis and the arrow with dotted line 
indicates the minor pathway of caffeine synthesis, respectively. As shown in the 
second line of Fig. 1, the pathway operating for biosynthesis of caffeine from 
xanthosine via 7-methylxanthine and theobromine has been known, which is the 
main pathway for biosynthesis of caffeine biosynthesis in coffee plants. The latter 
half of the main biosynthesis pathway of caffeine is composed of three steps of N- 
methylation reactions. These N-methylation reactions have been known to be 
dependent on S-adenosylmethoinine. There also exists a pathway (third line in 
Fig. 1) in which caffeine is biosynthesized from 7-methylxanthine via para-xanthine, 
but it is known that contribution of this pathway is not significant. With regard to 
the first methylation reaction to synthesize 7-methylxanthine, a gene encoding an 
enzyme responsible for said reaction has been obtained and it has been already 



reported (International Laid-Open Publication No. WO 97/35960). However, genes 
involved in the second step methylation reaction and the third step methylation 
reaction have not been known yet. For effective and accurate manipulation of 
caffeine biosynthesis, more knowledge on genes that encode enzymes involved in 
caffeine biosynthesis should be obtained. 
[0004] 

[Solution for Problems] 

The present inventors remarked an enzyme participating to the second 
methylation step reaction and responsible for biosynthesis of theobromine, and they 
have obtained the gene encoding the enzyme. The enzyme is an enzyme operating 
to catalyze biosynthesis of theobromine from 7-methylxanthine. Therefore, when 
expression of the gene encoding said enzyme in inhibited, it would result in decrease 
of theobromine biosynthesis. In the pathway of caffeine biosynthesis, caffeine is 
synthesized through N-methylation of theobromine. Then when biosynthesis of 
theobromine is inhibited, biosynthesis of caffeine would be inhibited as well. 
As described above, theobromine and caffeine exhibit similar pharmacological effect 
as xanthine derivatives. Therefore, isolation of a gene encoding an enzyme, which 
enables concurrent manipulation of theobromine biosynthesis and caffeine 
biosyntheses, has a great significance. That is, if a gene encoding an enzyme 
responsible for the final step of caffeine biosynthesis, i.e. the third methylation step 
is obtained and expression of the gene is inhibited, biosynthesis of caffeine would be 
reduced, but biosynthesis of theobromine would not be reduced. As a result, 
accumulation of theobromine is expected to occur, as the metabolism of theobromine 
is inhibited. Thus, considering that pharmacological effect of theobromine is 
similar to that of caffeine, the effect of the present invention, which relates to 
isolation of a gene encoding theobromine synthase, can be estimated to be significant. 
[0005] 

[Mode for carrying out the Invention] 

The present invention relates to theobromine synthase gene derived from 
Coffea arabica, consisting of a base sequence defined by the base numbers 1 to 1298 
shown in SEQ.ID. NO:2 in a Sequence List. As described above, in coffee plants, 
theobromine synthase catalyzes methylation reaction at biosynthesis of theobromine 
using 7-methylxanthine as the substrate. The gene defined by the base sequence 



described in SEQ.ID. N0:2 in a Sequence List is a gene encoding such theobromine 

synthase having. 

[0006] 

According to technique of gene recombination, artificial modification can 
be achieved at a specific site of basic DNA, without alteration or with improvement 
of basic characteristic of said DNA. Concerning a gene having native sequence 
provided according to this invention or modified sequence different from said native 
sequence, it is also possible to perform artificial modification such as insertion, 
deletion or substitution to obtain gene of equivalent or improved characteristic com- 
pared with said native gene. Moreover, a gene with such mutation is also included 
in the range of this invention. That is, the gene, consisting of a base sequence in 
which apart of base sequence shown in SEQ ID NO: 2 is deleted, substituted or added 
with another base sequence, means a gene in which 10 or less, preferably 7 or less, and 
more preferably 3 or less bases of the sequence is deleted, substituted or added to the 
base sequence shown in SEQ ID NO: 2 in a Sequence List. Moreover, such gene 
exhibits homology 90% or more, preferably 95% or more and still preferably 99% or 
more with the base sequence shown in SEQ ID NO: 2 in a Sequence List. In addition, 
such gene hybridizes with the base sequence shown in the SEQ ID NO: 2 in a Sequence 
List under stringent condition. Such gene is also within the range of this invention 
so far as it encodes a polypeptide having the characteristic as theobromine synthase, 
that catalyzes biosynthesis of theobromine using 7-methylxanthihe as the substrate. 
[0007] 

Furthermore, this invention relates to polypeptide of theobromine 
synthase derived Coffea arabica, consisting of an amino acid sequence defined by the 
amino acid numbers from 1 to 378 shown in SEQ ID NO: 1 in a Sequence List. 
The polypeptide consisting of an amino acid sequence in which a part of said 
polypeptide defined by amino acid sequence shown in SEQ ID NO: 1 is deleted, 
substituted or added with another amino acid sequence means a polypeptide in which 
10 or less, preferably 7 or less, and more preferably 3 or less amino acids of the 
sequence is deleted, substituted or added to the amino acid sequence shown in SEQ 
ID NO: 1 in a Sequence List. Moreover, such polypeptide exhibits homology 90% 
or more, preferably 95% or more and still preferably 99% or more with the amino 
acid sequence shown in SEQ ID NO: 1 in a Sequence List. Such polypeptide is also 



within the range of this invention so far as it exhibits characteristic as theobromine 
synthase, that catalyzes biosynthesis of theobromine using 7-methylxanthine as the 
substrate. Incidentally, the polypeptides shown in SEQ.ID. NO:3, SEQ.ID. NO:5 
and SEQ.ID. NO:7 in a Sequence List can be obtained from Coffea arabica, and the 
polypeptides have higher than 80% of homology compared with the amino acid 
sequence of SEQ.ID. NO:l in a Sequence List. These three polypeptides did not 
exhibit activity as theobromine synthase, despite of high homology to SEQ.ID. NO:l 
in a Sequence List. 
[0008] 

A transformed plant, in which expression of theobromine synthetase gene 
described in SEQ.ID. NO:2 in a Sequence List is inhibited to decrease biosynthesis 
of theobromine, is also within the scope of the present invention. The theobromine 
synthase gene of the present invention is, as mentioned above, a gene encoding an 
enzyme involved in biosynthesis of theobromine in coffea arabica. Thus, by 
inhibiting expression of the gene according to the present invention, biosynthesis of 
theobromine is assumed to decrease in a plant, whereby it enables decrease of 
theobromine content and caffeine content in the plant. As a plant of the target in 
which expression of theobromine synthase gene of the present invention is inhibited, 
coffee plants such as Coffea arabica, Coffea canephora, Coffea liberica and Coffea 
dewevrei and the like can be exemplified. 
[0009] 

In these plants, by inhibiting expression of the gene of the present 
invention, biosyntheses of theobromine and caffeine would be reduced. As a means 
for inhibiting expression of the gene, a method utilizing an antisense gene of the gene 
according to present invention can be adopted. The antisense gene means a gene 
that expresses a base sequence complementary to mRNA, a transcription product of 
DNA constituting a certain gene. The transcription product of the antisense gene is 
complementary to an inherent mRNA, then the antisense gene can inhibit gene 
expression at the stage of translation. By utilizing this technique, expression of 
theobromine synthase gene can be inhibited. 
[0010] 

In addition, other methods that can inhibit expression of a gene have been 
known. By destruction of a targeted gene, expression of the gene can be inhibited. 



Moreover, in a plant, technique of co-suppression (transwitch technique) has been 
known. According to the technique, expression of the targeted gene can be 
inhibited by phenomenon of gene interference, even when sense gene is introduced 
and over-expressed. Moreover, it has been reported in recent years that Double- 
stranded RNA interference (RNAi) method using a double stranded RNA is effective 
to inhibit expression of a gene (Chiou-Fen Chuang et al. PNAS (2000) vol. 97, 4985- 
4990). It has been demonstrated that a double strand RNA can inhibit expression of 
a gene in a sequence specific manner, according to the research mainly utilizing 
nematodes (C.elegans) or fruit fly. In the RNAi method, such double strand RNA is 
utilized and it has been recently demonstrated that the method is effective for not 
only nematodes or fruit fly but also for plants such as Arabidopsis thaliana Heynh. 
The mechanism involved in inhibition of gene expression by the RNAi method is not 
known yet. However, this method would enable inhibition of expression of a gene, 
with higher efficiency compared with the above-mentioned antisense method. 
[0011] 

By the way, purine alkaloids such as caffeine and theobromine, can 
exhibit effect to avoid insects and the effect is considered to be the existence value of 
purine alkaloids in a plant. Thus, the gene of the present invention can be 
introduced in a plant and biosythesis of theobromine can be increased in the plant, 
whereby the plant body would exhibit insect-avoiding activity. As described above, 
the enzyme of the present invention is responsible for biosynthesis of theobromine 
using 7-methylxantine as the substrate. Therefore, it is assumed that, when the 
above-mentioned gene encoding the 7-methylxanthine synthase (International Laid- 
Open Publication WO 97/35960) and the gene of the present invention are introduced 
into a plant concurrently, the effect would be particularly significant. When the 
activity of 7-methylxanthine synthase is enhanced, the amount of substrate available 
for the enzyme according to the present invention would be increased. As a result, 
accumulation of theobromine, which is the objective product, is expected to occur. 
[0012] 

As a method to produce a transformant, a method generally well known 
in this art can be adopted. A vector available for the present invention may include 
plasmid vectors, for example pBI121 can be exemplified, but the scope the vector is 
not to be limited to them. Such vector can be introduced into, for example, 



Agrobacterium. Then the bacteria can be utilized for infection of callus or plantlets, 
resulting in production of transformed plants. Furthermore, it is possible to obtain 
seeds derived from such transformed plants. In Japanese Laid-Open Patent 
Application No. 2000-245485, the present inventors have reported a method 
comprising infection of an embryogenic callus of a coffee plant by Agrobacterium 
tumefaciens EHA101 and the method enables transformation of coffee plants with 
high efficacy. The method for transformation described in Japanese Laid-Open 
Patent Application No. 2000-245485 is assumed to be particularly useful. 
[0013] 

[EXAMPLES] 
(Amplification by PCR) 

A pair of degenerate oligonucleotide (Forward primer, 
GGITGYDSIDSIGGICCIAAYAC; Reverse primer, 

ARIYKIYYRTRRAAISWICCIGG) was synthesized, based on the region conserved 
among TCS1 (Kato et al., 2000, GenBank accession no. AB031280) and two 
proteins (Z99708 and AC008153), with their functions unknown, of Arabidopsis 
thaliana. These oligonucleotides correspond to amino acid sequences of 
GC(A/S)(A/S)GPNT and PGSF(H/Y)(G/K)(R/N)LF, respectively. In a 25 jil of 
reaction mixture containing Coffea arabica cDNA and the above-mentioned primer 
pair, PCR was performed under the conditions described below. That is, after 
reaction at 94°C for one minute, 30 cycles of denaturation at 94°C for 30 seconds, 
annealing at 52°C for 30 seconds and extension at 72°C for one minutes was 
performed, which was followed by a final extension at 72°C for 7 minutes, whereby 
the PCR reaction was completed. The amplified cDNA fragment of about 270 base 
pairs was used for screening of cDNA library. 
[0014] 

(cDNA library construction and screening of the target cDNA) 

Total RNA was extracted from young leaves of coffee (Coffea arabica) 
and it was purified to mRNA by oligo-dT column (Pharmacia). cDNA was 
synthesized from mRNA using ZAPII cDNA synthesis kit (Stratagene), it was 
introduced into XZAPII vector to prepare phage library. Then cDNA library was 
screened using the above-mentioned amplified fragment as a probe. Thirty-five of 
resulting positive plaques were selected randomly and converted to plasmids, then 
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physical mappping and partial sequencing were performed. As a result, they were 

clarified into 4 groups of independent clones. 

[0015] 

Clones #1, #6, #35 and #45 were representatives of each group having 
the longest lengths close to full length cDNAs, and base sequences of the clones 
were determined. Moreover, the deduced amino acid sequences encoded by the 
open reading frame regions of the base sequences were determined. The base 
sequences determined by sequencing were shown in Fig. 2. The base sequence of 
cDNA obtained on the clone #45 was shown in SEQ.ID. NO:2 in a Sequence List 
and in Fig. 2D. The region corresponding to open reading frame of said gene 
ranged from base numbers 32 to 1168, and the deduced amino acid sequence 
encoded by said region was shown in SEQ.ID. NO:l in a Sequence List. Moreover, 
the base sequence of cDNA obtained on the clone #1 was shown in SEQ.ID. NO:4 in 
a Sequence List and in Fig. 2A. The region corresponding to open reading frame of 
said gene ranged from base numbers 14 to 1171, and the deduced amino acid 
sequence encoded by said region was shown in SEQ.ID. NO:3 in a Sequence List. 
Furthermore, the base sequence of cDNA obtained on the clone #6 was shown in 
SEQ.ID. NO:6 in a Sequence List and in Fig. 2B. The region corresponding to 
open reading frame of said gene ranged from base numbers 44 to 1201, and the 
deduced amino acid sequence encoded by said region was shown in SEQ.ID. NO:5 
in a Sequence List. Moreover, the base sequence of cDNA obtained on the clone 
#35 was shown in SEQ.ID. NO:8 in a Sequence List and in Fig. 2C. The region 
corresponding to open reading frame of said gene ranged from base numbers 45 to 
1163, and the deduced amino acid sequence encoded by said region was shown in 
SEQ.ID. NO:7 in a Sequence List. In the following, the gene corresponds the clone 
#45 was designated to MXMT1, the clone #1 was designated to MTL1, the clone # 6 
was designated to MTL2, and the clone #35 was designated to MTL3, respectively. 
[0016] 

The alignment compared among amino acid sequences encoded by 
MXMT1, MTL1, MTL2 and MTL3 was shown in Fig. 3. As a result, it was shown 
that these four sequences exhibit extremely high homology. To confirm the 
functions charge by these polypeptides, genes corresponding to each clone were 
expressed in E. coli to confirm their enzymatic activities. 



[0017] 

(Expression of GST fused protein) 

The open reading frame regions of MTL1 (Clone #1), MTL2 (Clone #6), 
MTL3 (Clone #35) and MXMT1 (Clone #45) were amplified by PCR (polymerase 
chain reaction). Then, they were optionally cloned into pGEX 4T-2 vector 
(Pharmacia) and E. coli (JM109) cells were transformed with the resulting plasmids. 
The obtained E. coli cells were cultured in LB liquid medium containing ampicillin. 
When OD600 of the culture reached to 0.5, IPTG (isopropyl thio-|3-D-galactoside) 
was added to it and the final concentration of IPTG was made to 1 mM, then the 
mixture was further cultured at 16°C for 6 hours. E. coli was desrupted by a 
sonicator and the protein of the purpose was purified by glutathione Sepharose 4B as 
a GST (glutathione S-transferase) fusion protein. Concentration of the protein was 
measured by the Bradford method. Each of the GST fusion protein (500 ng) was 
separated by SDS-PAGE (sodium dodecyl sulfate polyacrylamide gel electro- 
phoresis), then it was stained by CBB (coumasie Brilliant Blue) to confirm 
purification. The purities of the resulting GST fusion proteins were analyzed by 
SDS-PAGE and the results were shown in Fig. 4. In Fig. 4, lane 1 shows the result 
of MTL2 fusion protein, lane 2 shows the result of MTL3 fusion protein, lane 3 
shows the result of MXMT1 fusion protein, respectively. As a result, the resulting 
three fusion proteins were shown to be approximately pure. 
[0018] 

(Measurement of enzymatic activities by thin layer chromatography) 

Measurement of enzymatic activity was performed using thin layer 
chromatography (PLC), based on the method of Kato et al. (Plant Physiol., 1996, 98, 
629-636). In concrete, the reaction mixture of 100 ^1, containing 100 mM Tris-HCl 
(pH 7.5), 200 //M substrate (xanthine, 7-methylxanthine, theobromine, paraxanthine, 
theophylline), 4 fiM 14 C-labeled S-adenosylmethionine, 200 fjM MgCl 2 , 200 ng GST 
fusion protein, was incubated at 27°C for 2 hours. After the reaction, the resulting 
mixture was extracted with 1 ml of chloroform, the chloroform layer was recovered, 
then chloroform was evaporated by speed back concentrator. The residue was 
dissolved in 5 fil of 50% methanol solution, then the solution was developed by TLC 
(solvent for development was water:acetic acid:n-butanol= 2:1:4, v/v/v). After the 
development, signal of radio activity was detected by image analyzer (Fuji BAS 
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2000). The result of enzymatic activity, which was measured on the fusion proteins 
derived from MTL2, MTL3 and MXMT1 using xanthine (X), 7-methylxanthine (7- 
Mx), theobromine (Tb), paraxanthine (Px) and theophylline (Tp) as the substrate, 
was shown in Fig. 5. From Fig. 5, it was revealed that the fusion protein derived 
from MXMT1 exhibited potent activity to synthesize theobromine, using 7- 
methylxanthine as the substrate. The fusion protein derived from MXMT1 also 
exhibited activity to synthesize caffine, using paraxanthine as the substrate, but its 
relative activity was 15% of the above-mentioned activity. On the other hand, the 
fusion proteins derived from MTL2 and MTL3 did not exhibit activity as a methyl 
transferase, using the above-mentioned compounds as the substrate. 
[0019] 

(Enzymatic activity measurement and identification of the product by HPLC) 

Using high performance liquid chromatography (HPLC), enzymatic 
activity of the MXMT1 fusion protein was measured and reaction product obtained 
from the enzymatic reaction was identified. The reaction mixture of 100 fil, 
containing 100 mM Tris-HCl (pH 7.5), 200 /uM of substrate (7-methylxanthine, 
paraxanthine, theobromine), 50 fxM of S-adenosylmethionine, 200 //M of MgCb, 
200 ng of GST fusion protein, was incubated at 27°C for 2 hours. After incubation, 
the mixture was extracted with 1 ml of chloroform, the chloroform layer was 
recovered, then chloroform was evaporated by a speed back concentrator. 
The residue was dissolved in 50 jul of 12% acetonitrile. Then the solution was 
fractionated by HPLC (Shodex Rspak DS-613 column) provided with UV detection 
system. As the solution for development, 12% acetonitrile was used and the signal 
was detected for absorbance of 254 nm. 
[0020] 

The result was shown in Fig. 6. The MXMT1 fusion protein was reacted 
with S-adenosylmethionine and 7-methylxanthine, which is the substrate and the 
reaction product was analyzed by HPLC. The chart exhibiting the result was shown 
in Fig. 6A. Moreover, theobromine was analyzed for a standard compound using 
HPLC and the chart exhibiting the result was shown in Fig. 6B. For preparation of 
negative standard, the MXMT1 fusion protein, S-adenosylmethionine and 7- 
methylxanthine was mixed and the reaction was immediately stopped and the chart 
exhibiting the result was shown in Fig. 6C. For standard products, 7-methylxantine, 
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theobromine, paraxanthine and caffeine were analyzed by HPLC, and the chart 
exhibiting the result was shown in Fig. 6D. Furthermore, S-adenosylmethionine and 
7-methylxanthine was reacted with MXMT1 fusion protein and then theobromine was 
added to the reaction mixture. The chart exhibiting the result was shown in Fig. 6E. 
The peak position of the reaction product detected in Fig. 6A coincided with the 
position of theobromine, which was analyzed as the standard compound. In addition, 
when theobromine was added to the enzymatic reaction mixture, only one peak was 
observed. Therefore, it was shown that theobromine was formed by enzymatic 
reaction of the MXMT1 fusion protein, using 7-methylxantine as the substrate. 
[0021] 

According to the present invention, the polypeptide of theobromine 
synthase derived from coffea arabica and the gene encoding said polypeptide were 
provided. As theobromine synthase participates in biosynthesis of caffeine, 
caffeineless coffee would be obtained by preparing a transformed plant, wherein 
expression of gene encoding said enzyme was inhibited. 
[0022] 

[Sequence Listing] 

<110> Applicant name: President of Nara Institute Science and Technology 

<120> Title of invention: Theobromine synthase polypeptide of coffee plant and the 

gene encoding said polypeptide 

<160> Number of sequences: 8 

<210> Sequence number: 1 

<211> Length of sequence: 378 

<212> Type pf sequence: Amino acid 

<213> Organism: Caffea arabica 

<400> Sequence 

MELQEVLHMN EGEGDTSYAK NASYNLALAK VKPFLEQCIR ELLRANLPNI NKCIKVADLG 60 
CASGPNTLLT VRDIVQSIDK VGQEEKNELE RPTIQIFLND LFQNDFNSVF KLLPSFYRKL 120 
EKENGRKIGS CLISAMPGSF YGRLFPEESM HFLHSCYSVH WLSQVPSGLV IELGIGANKG 180 
SIYSSKGCRP PVQKAYLDQF TKDFTTFLRI HSKELFSRGR MLLTCICKVD EFDEPNPLDL 240 
LDMAINDLIV EGLLEEEKLD SFNIPFFTPS AEEVKCIVEE EGSCEILYLE TFKAHYDAAF 300 
SIDDDYPVRS HEQIKAEYVA SLIRSVYEPI LASHFGEAIM PDLFHRLAKH AAKVLHMGKG 360 
CYNNLIISLA KKPEKSDV 378 
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<210> Sequence number: 2 
<211> Length of sequence: 1298 
<212> Type of sequence: Nucleic acid 
<213> Organism: Caffea arabica 
<400> Sequence 



AGCAGTCGCA ATTCGATTGT CCTGCATATG 


AATGGAGCTC 


CAAGAAGTCC 


TGCATATGAA 


60 


TGAAGGTGAA GGCGATACAA GCTACGCCAA GAATGCATCC 


TACAATCTGG 


CTCTTGCCAA 


120 


GGTGAAACCT TTCCTTGAAC AATGCATACG 


AGAATTGTTG 


CGGGCCAACT 


TGCCCAACAT 


180 


CAACAAGTGC ATTAAAGTTG CGGATTTGGG 


ATGCGCTTCT 


GGACCAAACA 


CACTTTTAAC 


240 


AGTGCGGGAC ATTGTGCAAA GTATTGACAA 


AGTTGGCCAG GAAGAGAAGA ATGAATTAGA 


300 


ACGTCCCACC ATTCAGATTT TTCTGAATGA 


TCTTTTCCAA 


AATGATTTCA ATTCGGTTTT 


360 


CAAGTTGCTG CCAAGCTTCT ACCGCAAACT 


CGAGAAAGAA 


AATGGACGCA 


AGATAGGATC 


420 


GTGCCTAATA AGCGCAATGC CTGGCTCTTT 


CTACGGCAGA 


CTCTTCCCCG 


AGGAGTCCAT 


480 


GCATTTTTTG CACTCTTGTT ACAGTGTTCA 


TTGGTTATCT 


CAGGTTCCCA 


GCGGTTTGGT 


540 


GATTGAATTG GGGATTGGTG CAAACAAAGG 


GAGTATTTAC 


TCTTCCAAAG 


GATGTCGTCC 


600 


GCCCGTCCAG AAGGCATATT TGGATCAATT 


TACGAAAGAT 


TTTACCACAT 


TTCTAAGGAT 


660 


TCATTCGAAA GAGTTGTTTT CACGTGGCCG 


AATGCTCCTT ACCTGCATTT 


GTAAAGTAGA 


720 


TGAATTCGAC GAACCGAATC CCCTAGACTT 


ACTTGACATG 


GCAATAAACG 


ACTTGATTGT 


780 


TGAGGGACTT CTGGAGGAAG AAAAATTGGA 


TAGTTTCAAT 


ATTCCATTCT 


TTACACCTTC 


840 


AGCAGAAGAA GTAAAGTGCA TAGTTGAGGA 


GGAAGGTTCT 


TGCGAAATTT 


TATATCTGGA 


900 


GACTTTTAAG GCCCATTATG ATGCTGCCTT 


CTCTATTGAT 


GATGATTACC 


CAGTAAGATC 


960 


CCATGAACAA ATTAAAGCAG AGTATGTGGC 


ATCATTAATT 


AGATCAGTTT 


ACGAACCCAT 


1020 


CCTCGCAAGT CATTTTGGAG AAGCTATTAT 


GCCTGACTTA 


TTCCACAGGC 


TTGCGAAGCA 


1080 


TGCAGCAAAG GTTCTCCACA TGGGCAAAGG 


CTGCTATAAT 


AATCTTATCA 


TTTCTCTCGC 


1140 


CAAAAAGCCA GAGAAGTCAG ACGTGTAAAA GTTTGTTTTT 


AGTTGGTTTT 


TGTGCCGTTG 


1200 


GGGGTCTTTC' GGGTATTGTC GTTTTGTATT 


CGTAATAAAA GTGATGTGCA AGAATAAGAT 


1260 


ATTTAGTACA ATATTTTCAT AAAAAAAAAA 


AAAAAAAA 






1298 
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<210> Sequence number: 3 
<211> Length of sequence: 385 
<212> Type of sequence: Amino acid 
<213> Organism: Caffea arabica 
<400> Sequence 

MELQEVLHMN GGEGEASYAK NSSFNQLVLA KVKPVLEQCV RELLRANLPN INKCIKVADL 60 
GCASGPNTLL TVWDTVQSID KVKQEMKNEL ERPTIQVFLT DLFQNDFNSV FMLLPSFYRK 120 
LEKENGRKIG SCLIAAMPGS FHGRLFPEES MHFLHSSYSL QFLSQVPSGL VTELGITANK 180 
RSIYSSKASP PPVQKAYLDQ FTKDFTTFLR MRSEELLSRG RMLLTCICKG DECDGPNTMD 240 
LLEMAINDLV AEGRLGEEKL DSFNVPIYTA SVEEVKCMVE EEGSFEILYL QTFKLRYDAG 300 
FSIDDDCQVR SHSPVYSDEH ARAAHVASLI RSVYEPILAS HFGEAIIPDI FHRFATNAAK 360 
VIRLGKGFYN NLIISLAKKP EKSDI 385 
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<210> Sequence number : 4 
<211> Length of sequence: 1360 
<212> Type of sequence: Nucleic acid 
<213> Organism: Caffea arabica 
<400> Sequence 

GTCCTGCATA TGAATGGAGC TCCAAGAAGT CCTGCATATG AATGGAGGCG AAGGCGAAGC 60 
AAGCTACGCC AAGAATTCAT CCTTCAATCA ACTGGTTCTC GCCAAGGTGA AACCTGTCCT 120 
TGAACAATGC GTACGGGAAT TGTTGCGGGC . CAACTTGCCC AACATCAACA AGTGCATTAA 180 
AGTTGCAGAT TTGGGATGCG CTTCCGGACC AAACACACTT TTAACCGTTT GGGACACTGT 240 
ACAAAGTATT GACAAAGTTA AGCAAGAAAT GAAGAATGAA TTAGAACGTC CCACCATTCA 300 
GGTTTTTCTG ACTGATCTTT TCCAAAATGA TTTCAATTCG GTTTTCATGC TGCTGCCAAG 360 
CTTCTACCGC AAACTTGAGA AAGAAAATGG ACG CAAAAT A GGATCGTGCC TAATAGCCGC 420 
AATGCCTGGC TCTTTCCACG GCAGACTCTT CCCCGAGGAG TCCATGCATT TTTTACACTC 480 
TTCTTACAGT CTTCAGTTTT TATCCCAGGT TCCCAGCGGT TTGGTGACTG AATTGGGGAT 540 
CACTGCGAAC AAAAGGAGCA TTTACTCTTC CAAAGCAAGT CCTCCGCCCG TCCAGAAGGC 600 
ATATTTGGAT CAATTTACGA AAGATTTTAC CACATTTTTA AGGATGCGTT CGGAAGAGTT 660 
GCTTTCACGT GGCCGAATGC TCCTTACTTG CATTTGTAAA GGAGATGAAT GCGACGGCCC 720 
GAATACCATG GACTTACTTG AGATGGCAAT AAACGACTTG GTTGCTGAGG GACGTCTGGG 780 
GGAAGAAAAA TTGGACAGTT TCAATGTTCC AATCTATACA GCTTCAGTAG AAGAAGTAAA 840 
GTGCATGGTT GAGGAGGAAG GTTCTTTTGA AATTTTATAC TTGCAGACTT TTAAGCTCCG 900 
TTATGATGCT GGCTTCTCTA TTGATGATGA TTGCCAAGTA AGATCCCATT CCCCAGTATA 960 
CAGCGATGAA CATGCTAGAG CAGCGCATGT GGCATCATTA ATTAGATCAG TTTACGAACC 1020 
CATCCTAGCA AGTCATTTTG GAGAAGCTAT TATACCTGAC ATATTCCACA GGTTTGCGAC 1080 
GAATGCAGCA AAGGTTATCC GCTTGGGCAA AGGCTTCTAT AATAATCTTA TCATTTCTCT 1140 
TGCCAAAAAA CCAGAGAAGT CAGACATATA AAAGCTTGTT TTTAGTTGGT TTTTGTGTTA 1200 
TGGGTTGTTT TCTGATACGG GGAAAGGATT CAGTGCGGTT GGGGTTCTAT CCGAGTATTG 1260 
TACTTTTTAT ATTATTAGTT GGTGTATAAT TATTATGTTA CATTGTTATA TTCGTAATAA 1320 
AAGTGACGTA CAAAAATAAA ATATTTTCAT AAAAAAAAAA 1360 
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<210> Sequence number: 5 
<211> Length of sequence: 385 
<212> Type of sequence: Amino acid 
<213> Organism: Caffea arabica 
<400> Sequence 

V 

\ 

MELQEVLHMN GGEGDASYAK NSSFNQLVLA KVKPVLEQCV GELLRANLPN INKCIKVADL 60 
GCASGPNTLL TVRDIVQSID KVRQEMKNEL ERPTIQVFLT DLFQNDFNSV FMLLPSFYRK 120 
LEKENGRKIG SCLIAAMPGS FHGRLFPEES MHFLHSSYSL QFLSQVPSGL VTELGITANK 180 
RSIYSSKASP PPVQKAYLDQ FTKDFTTFLR IRSEELLSRG RMLLTCICKG DEFDGPNTMD 240 
LLEMAINDLV VEGHLEEEKL DSFNVPIYAA SVEELKCIVE EEGSFEILYL ETFKLRYDAG 300 
FSIDDDCQVR SHSPEYSDEH ARAAHVASLL RSVYEPILAN HFGEAIIPDI FHRFATNAAK 360 
VIRLGKGFYN NLIISLAKKP EKSDI 385 
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<210> Sequence number: 6 
<211> Length of sequence: 1304 
<212> Type of sequence: Nucleic acid 
<213> Organism: Caffea arabica 
<400> Sequence 

TTTAGCAGTC CCAATTCGAT TTATGTACAA GTCCTGCATA TGAATGGAGC TCCAAGAAGT 60 
CCTGCATATG AATGGAGGCG AAGGCGATGC AAGCTACGCC AAGAATTCAT CCTTCAATCA 120 
ACTGGTTCTC GCCAAGGTGA AACCTGTCCT TGAACAATGC GTAGGGGAAT TGTTGCGGGC 180 
CAACTTGCCC AACATCAACA AGTGCATTAA AGTTGCGGAT TTGGGATGCG CTTCCGGACC 240 
AAACACACTT TTAACAGTTC GGGACATTGT ACAAAGTATT GACAAAGTTA GGCAAGAAAT 300 
GAAGAATGAA TTAGAACGTC CCACCATTCA GGTTTTTCTG ACTGATCTTT TCCAAAATGA 360 
TTTCAATTCG GTTTTCATGT TGCTGCCAAG TTTCTACCGC AAACTTGAGA AAGAAAATGG 420 
ACGCAAGATA GGATCGTGCC TAATAGCCGC AATGCCTGGC TCTTTCCACG GCAGACTCTT 480 
CCCCGAGGAG TCAATGCATT TTTTACACTC TTCTTACAGT CTTCAATTTT TATCCCAGGT 540 
TCCCAGCGGT TTGGTGACTG AATTGGGGAT CACTGCGAAC AAAAGGAGCA TTTACTCTTC 600 
CAAAGCAAGT CCTCCGCCCG TCCAGAAGGC ATATTTGGAT CAATTTACGA AAGATTTTAC 660 
CACATTTTTA AGGATTCGTT CGGAAGAGTT GCTTTCACGC GGCCGAATGC TCCTTACTTG 720 
CATTTGCAAA GGAGATGAAT TCGACGGCCC GAATACCATG GACTTACTTG AGATGGCAAT 780 
AAACGACTTG GTTGTTGAGG GACATCTGGA GGAAGAAAAA TTGGACAGTT TCAATGTTCC 840 
AATCTATGCA GCTTCAGTAG AAGAATTAAA GTGCATAGTT GAGGAGGAAG GTTCTTTTGA 900 
AATTTTGTAC TTGGAGACTT TTAAGCTCCG TTATGATGCT GGCTTCTCTA TTGATGATGA 960 
TTGCCAAGTA AGATCCCATT CCCCAGAATA CAGCGATGAA CATGCTAGAG CAGCGCATGT 1020 
GGCATCATTA CTTAGATCAG TTTACGAACC CATCCTCGCA AATCATTTTG GAGAAGCTAT 1080 
TATACCTGAC ATATTCCACA GGTTTGCGAC GAATGCAGCA AAGGTTATCC GCTTGGGCAA 1140 
AGGCTTCTAT AATAATCTTA TCATTTCTCT TGCCAAAAAA CCAGAGAAGT CAGACATATA 1200 
AAAGCTTGTT TATAGTTGGT TTTTGTGCTA TGGTTTGTTT TCTGATACGG GGAAAGGATT 1260 
TAGTGCGGTT GGGGTTCAAA AAAAAAAAAA AAAAAAAAAA AAAA 1304 
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<210> Sequence number: 7 
<211> Length of sequence: 372 
<212> Type of sequence: Amino acid 
<213> Organism: Caffea arabica 
<400> Sequence 

MELQEVLRMN GGEGDTSYAK NSAYNQLVLA KVKPVLEQCV RELLRANLPN INKCIKVADL 60 
GCASGPNTLL TVRDIVQSID KVGQEKKNEL ERPTIQIFLN DLFPNDFNSV FKLLPSFYRK 120 
LEKENGRKIG SCLIGAMPGS FYSRLFPEES MHFLHSCYCL QWLSQVPSGL VTELGISTNK 180 
GSIYSSKASR LPVQKAYLDQ FTKDFTTFLR IHSEELFSHG RMLLTCICKG VELDARNAID 240 
LLEMAINDLV VEGHLEEEKL DSFNLPVYIP SAEEVKCIVE EEGSFEILYL ETFKVLYDAG 300 
FSIDDEHIKA EYVASSVRAV YEPILASHFG EAIIPDIFHR FAKHAAKVLP LGKGFYNNLI 360 
ISLAKKPEKS DV 372 
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<210> Sequence number: 8 
<211> Length of sequence: 1316 
<212> Type of sequence: Nucleic acid 
<213> Organism: Caffea arabica 
<400> Sequence 



CTTTGGCAGT 



CCCAATTTGA 



TTTATGTACA 



AGTCCTGCAT ATGAATGGAG 



CTCCAAGAAG 



60 



TCCTGCGGAT GAATGGAGGC GAAGGCGATA CAAGCTACGC CAAGAATTCA GCCTACAATC 120 
AACTGGTTCT CGCCAAGGTG AAACCTGTCC TTGAACAATG CGTACGGGAA TTGTTGCGGG 180 
CCAACTTGCC CAACATCAAC AAGTGCATTA AAGTTGCGGA TTTGGGATGC GCTTCTGGAC 240 
CAAACACACT TTTAACAGTT CGGGACATTG TCCAAAGTAT TGACAAAGTT GGCCAGGAAA 300 
AGAAGAATGA ATTAGAACGT CCCACCATTC AGATTTTTCT GAATGATCTT TTCCCAAATG 360 
ATTTCAATTC GGTTTTCAAG TTGCTGCCAA GCTTCTACCG CAAACTTGAG AAAGAAAATG 420 
GACGCAAAAT AGGATCGTGC CTAATAGGGG CAATGCCCGG CTCTTTCTAC AGCAGACTCT 480 
TCCCCGAGGA GTCCATGCAT TTTTTACACT CTTGTTACTG TCTTCAATGG TTATCTCAGG 540 
TTCCTAGCGG TTTGGTGACT GAATTGGGGA TCAGTACGAA CAAAGGGAGC ATTTACTCTT 600 
CCAAAGCAAG TCGTCTGCCC GTCCAGAAGG CATATTTGGA TCAATTTACG AAAGATTTTA 660 
CCACATTTCT AAGGATTCAT TCGGAAGAGT TGTTTTCACA TGGCCGAATG CTCCTTACTT 720 
GCATTTGTAA AGGAGTTGAA TTAGACGCCC . GGAATGCCAT AGACTTACTT GAGATGGCAA 780 
TAAACGACTT GGTTGTTGAG GGACATCTGG AGGAAGAAAA ATTGGATAGT TTCAATCTTC 840 
CAGTCTATAT ACCTTCAGCA GAAGAAGTAA AGTGCATAGT TGAGGAGGAA GGTTCTTTTG 900 
AAATTTTATA CCTGGAGACT TTTAAGGTCC TTTACGATGC TGGCTTCTCT ATTGACGATG 960 
AACATATTAA AGCAGAGTAT GTTGCATCTT CCGTTAGAGC AGTTTACGAA CCCATCCTCG 1020 
CAAGTCATTT TGGAGAAGCT ATTATACCTG ACATATTCCA CAGGTTTGCG AAGCATGCAG 1080 
CAAAGGTTCT CCCCTTGGGC AAAGGCTTCT ATAATAATCT TATCATTTCT CTCGCCAAAA 1140 
AGCCAGAGAA GTCAGACGTG TAAAAGTTTG TTTTTGTGTT GGGGAAAGGA ATAAGTGCCG 1200 
TTGGGGGTCT TTCGGGTATT GTGCTTTTTA TATTATATTG TTTTGTATCC GTAATAAAAG 1260 
TGGTGTGTAA GAATAAGATA TTTGACATAT ATTATTTTCA AAAAAAAAAA AAAAAA 1316 
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[Brief description of the drawings] 
[Figl] 

Fig. 1 is a drawing showing the pathway of caffeine biosynthesis. 

[Fig 2] 

Fig. 2 is a drawing showing base sequences of cDNAs obtained from 
MTL1, MTL2, MTL3 and MXMT1. 
[Fig 3] 

Fig. 3 is a drawing showing alignment of amino acid sequences obtained 
from MXMT1, MTL2 and MTL3. 
[Fig 4] 

Fig. 4 is a photograph showing the results of SDS-PAGE analyses 
performed on fusion proteins obtained from MTL2, MTL3 and MXMT1. 
[Fig 5] 

Fig. 5 is a photograph showing the results of TLC to analyze enzymatic 
activities of the fusion proteins obtained from MTL2, MTL3 and MXMT1. 
[Fig 6] 

Fig. 6 is a chart showing the results of HPLC performed to identify 
reaction products in the enzymatic reaction mixture of the fusion protein obtained 
'from MXMT1 identified by HPLC. 
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gtcctgcata 
actggttctc 
acttccacat 

GAAGAATCAA 
CTTCTACCGC 
CCCCGAGCAC 
CACTGCGAAC 

cacattttta 
gaataccatc 
aatctataca 
ttatgatcct 
ggcatcatta 
gaatgcagca 

AAAGCTTCn 
TACTTTTTAT 



TGAATGGAGC 
GCCAAGGTGA 
TTGGGATGCG 
TTACAACCTC 
AAACTTGAGA 
TCaTGCATT 
AAAAGGAGCA 
AGGATGCGTT 
GACTTACTTG 
GCTTCAGTAG 
GGCTTCTCTA 
ATTAGATCAG 
AAGGTTATCC 
TTTAGTTGGT 
ATTATTAGTT 



TCCAAGAAGT 
AACQGTCCT 
CTKCGGACC 
CCACCATTCA 
AAGAAAATGG 
TTTTACACTC 
TTTACTC7TC 
CGGAAGAGTT 
AGATGGCAAT 
AAGAAGTAAA 
TTGATGATGA 
rTTACGAACC 
GCTTGGGCAA 
TTTTGTGTTA 
GGTGTATAAT 



CCTGCATATG 
TGAACAATGC 
AAACACACTT 
GGTTTTTCTC 
ACGCAAAATA 
TTCTTACAGT 
CAAAGCAAGT 
GOTTCACGT 
AAACGACTT6 
GTGCATGGTT 
TTGCCAAGTA 
CATCCTAGCA 
AGGCTTCTAT 
T6GGTTGTTT 
TATTArGTTA 



aatggaggcg 
ctacgggaat 
ttaaccgttt 
actgatcttt 
ggatcgtgcc 
cttcagtttt 
cctccgcccg 
ggccgaatgc 
gttgctgagg 
gaggaggaag 
agatcccatt 
agtcattttg 
aataatctta 
tct6atacgg 
cattgttata 



AAGGCCAAGC 
TGTTGCGGGC 
GGGACACTGT 
TCCAAAATGA 
TAATAGCCGC 
TATCCCAGGT 
TCCAGAAGGC 
TCCTTACTTG 
GAC CTCTG GG 
6TTCTTTTCA 
CCCCAGTATA 
GAGAAGCTAT 
TCATTTCTCT 
GGAAAGGATT 
TTCCTAATAA 



AAGCTACGCC 
CAACT7GCCC 
ACAAAGTATT 
TTTCAATTCG 
AATGCCTGCC 
TCCCAGCGGT 
ATATTTGGAr 

catttgtaaa 
ggaagaaaaa 

AAmTATAC 
CAGCGATGAA 
TATACCTGAC 
TGCCAAAAAA 
CAGTGCGGTT 
AAGTGACGTA 



AAGAATTCAT CCTTCAATCA 
AACATCAACA AGTGCATTAA 
GACAAA GTTA AGCAAGAAAr 
GTTTTCATGC TGCTGCCAAG 

Ten tccacg gcagactctt 

TTGGrGACTG AATTGGGGAT 
CAATTTACGA AAGATTTTAC 
GGAGATGAAT GCGACGGCCC 
TTGGACAGTT TCAATGTTCC 
TTGCACACTT TTAAGCTCCG 
CATGCTAGAC CAGCGCATGT 
ATATTCaCA GGTTTGCGAC 
CCAGAGMGT CAGACATATA 
GGGGTTCTAT CCGAGTATTG 
CAAAAATAAA ATATTTTCAT 



99 
ltd 
270 
360 
450 
540 
630 
720 
810 
900 
990 

im 

1170 
1260 
1350 
1360 



B 



90 
180 
270 
360 



TTTAGCAGTC CCAATTCGAT TTATGTACAA GTCCTGCATA TGAATGCACC TCCAAGAAGT CCTGCATATG AaTGGaGGTG AArj-rrnrrr 
AAGCTACGCC AAGAATTCAT CCTTCAATCA ACTGGTTCTC GCCAAGGTGA AAcItCTCCT TGMCMTGC S tCt££2£ 
CAACTTGCCC AACATCAACA AGTGCATTAA AGTTGCGGAT TTGGGATGCG CTTCCGGACC AAACACACTT TTAACAGTTC 
ACAAAGTATT GACAAAGTTA GGCAAGAAAT GAAGAATCAA TTAGAACGTC CCACCATTCA GGTTTTTCTG ACTGaJSS r£££S£ ^ 
TTTCAATTCG GTTTTCATGT TGCTGCCAAG TTTCTACCGC AAACTTGAGA AAGAAAATGG ACGCAAGATA gStcGTGCC 
AATGCCTGGC TCTTTCCACC GCAGACTCTT CCCCGAGGAG TCAATGCATT TTTTACACTC TTCTTACAGT CTTCMTTTT ifiSSES 22 

tcccagcggt ttcgtgactg aattggggat cactgcgaac aaaaggagca tttactcttc ouagcaact cctccgcS ISSSSSZ IS 

ATATTTGGAT CAATTTACGA AAGATTTTAC CACATTTTTA ACGATTCGTT CGGAAGAGTT GCTTTGKGC WCCCMTK ISfSJSS 2 
CATTTGCAAA GGAGATGAAT TCGACGGCCC GAATACCATG GACTTACTTG AGATGGCAAT SaCGACTtI mSwZ £SJc£5 III 
SJSJJS 2^"£J TCAATGTTCC AATCTATGCA GCTTCAGTAG AAGAATTAAA GTGCATAGTT GAKAGGMG GTTCTTtTS 
AATTTTGTAC TTGGAGAOT TTAAGCTCCG TTATCATCCT GGCTTCTCTA TTGATGATGA TTGCCAAGTA AWCCOTT CCCCAGAATA 

cagcgatca* catgctagag cagcgcatgt ggcatcatta cttagatcag tttacgaacc catcctcgca wcattttg ScugSI? J2 

TATACCTGAC ATATTCCACA GGTTTGCGAC GAATGCAGCA AAGGTTATCC GCTTGGGCAA AGGCTTCTAT AATAATCTTA SrrTCTCT S» 
V^SH^t S*"?* 1 AAAGCTTGTT TATAGTTGGT TTTTGTGCTA TGGTTTGTTT TCTGATACGG GGAMGGATT £2 

TAGTGCCCTT GGGGTTCAAA AAAAAAAAAA AAAAAAAAAA AAAA w*AM*ATT 1Z6B 



CTTTGGCAGT CCCAATTTCA TTTATGTACA AGTCCTGCAT ATCAATGGAG CTCCAACAAG TCCTGCGGAT GAATGGAGGC GAAGGCGATA 40 

CAACaACGC CAAGAATTCA CCCTACAATC AACTGGTTCT CGCCAAGGTG AAACCTGTCC RGAACMTC CCTACGGGAA TTCTTGcS ill 
CCAACTTGCC CAACATCAAC AAGTCCATTA AAGTTGCGGA TTTGGGATGC GCTTaGGAC CAAACACACT TTTAACACTT CGGto£m 

TCCAAAGTAT TGACAAAGTT GGCCAGGAAA AGAAGAATGA ATTAGAACGT CCCACCATTC A6ATTTTTCT GAATCATCTT nSfSZc 52 

ATTTCAATTC GGTTTTCAAG TTGCTGCCAA - GCTTCTACCG CAAACTTCAG A*S£lrG WGCAAAAT AGGATCGTGC Ow££ £ 

CAATGCCCGG CTCTTTCTAC AGCAGACTCT TCCCCGAGGA GTCCATGCAT TTTTTACACT CTtStaCTG TCnCAATGG Sat£££ M 

IEJS2 JJI55S ^ C<!CGA rcwrwCGAA CAMCCGACC atttactctt CCAAAGCAAG TOTCTGCCC gtccagaagg IS 

CATAmGGA TCAATTTACG AAAGATTTTA CCACATTTCT AAGGATTCAT TCGCAAGAGT TGTTTTCACA TGGCCGAATG CTCCTTACTT 720 

gcatttgtaa aggagttgaa ttagacgccc GCAATCcar agacttactt gacatgccaa tamccactt gcttgttgag Ska7££ 
ifi^ ™™ 1AGT nCAATCTrc aCTCTATAT ^cttcacca gmgaagua agtgcatagt tSSSa Sttctttt? 

AAATTTTATA CCTGGAGACT TTTAAGGTCC TTTACGATGC TCGCTTCTCT ATTGACGATG AACATATui AGCAGAGTAT GTTGCATCTT 

SSS £232 SS55S SKSS SSKS SSSSS1 5S5E SSSS 5S 



tie 

900 
990 
1080 
1170 



AGCAGTCGCA ATTCCATTGT CCTGCATATG AATGGAGCTC CAACAACTCC TGCATATGAA TCAAGGTCAA GGCGATACAA GCTACGCCAA 

TACAATGGG CTCTTGCCAA GGTGAAACCT TTCCTTGAAC AATGCATACG AGAATTGTTG CGC^CAACT TGaCAA^If IM 

CAACAAGTGC ATTAAAGTTG CGGATTTGGG ATGCGCTTCT GGACCAAACA CACTTTTAAC AGTGCGGGAC ATTCTGTAii CTATTTirAi \ll 

AGTTGGCCAG GAAGAGAAGA ATGAATTAGA ACGTCCCACC ATTcSatTT TTCTOAtIa TCTTTTCCAA WtStTTCA fTTCamT IS 

CAAGTTGCTG CCAACCTTCT ACCGCAAACT CGAGAAAGAA AATGGACGCA AGATAGGATC GTGCCTAATA AGCGCAATGC CTGGCTCTTT 45ft 

^CCCG AGGAGTCCAT GCATTTTTTG CACTCTTGTT ACAGTCTTCA nGCTTATCT fi^TC S S £ 

CATTGAATTC GGGATTCGTG CAAACAAAGG GAGTATTTAC TCTTCCAAAG GATGTCGTCC GCCCGTCCAC AAGGCATATT TGGATCAATT «0 

TACGAAAGAT TTTACCACAT TTCTAAGGAT TCATTCGAAA GAGTTGTTTT CACGTGGCCG AATCCTCCTT ACCTGCATTT CTMACrtS m 

tcaattcgac waccgaatc ccctagactt acttgacatg gcaataaacg acttgahgt tJgggactt c^gaggaaJ aaa^SJS m 

TAGTTTCAAT ATTCCATTCT TTACACCTTC AGCAGAAGAA CTAAAGTGCA TAGTTGAGGA GGAAGGTTCT TGCGAAATTT TATATCTGGA 900 

OACTTTTAAG GCCCATTATG ATGCiaCTT CTCTATTGAT GATGATTACC CAGTAAGATC CCATGAACAA ATTAAAGCAG AGTATGTGGC 2S 

ATCATTAATT AGATCAGTTT ACGAACCCAT CCTCGCAAGT CATTTTGGAG AAGCTATTAT GCCTGACTTA TKCACAWC ml 

T^GCAAAG GnaCCACA TGGCCAAAGG CTCCTATAAT AATCTTATCA TTTCTCTCGC CAAAMGCCA WGAACTCAG ACCTGTAAAA U70 

GTTTGTTTTT AGTTGGTTTT TGTGCCGTTG GGGCTCTTTC GGGTATTGTC GTTTTGTATT CGTAATAAAA GTGATGTGCA AGAATAAGAT 126fl 

ATTTAGTACA ATATTTTCAT AAAAAAAAAA AAAAAAAA W ' J*JJ 



f 

/ ' 



= 2000P095 



H: 3/ 5 



[03] 
[Fig. 3] 
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[Fig. 6] 
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[Identification of Document] Abstract 

[Abstract] 

[Object] 

The object of this invention is to isolate a gene participating in 
biosynthesis of caffeine for obtain a caffeineless coffee. 
[Solving means] 

According to the present invention, the polypeptide of theobromine 
synthase derived from coffea arabica and the gene encoding said polypeptide are 
provided. As theobromine synthase participates in biosynthesis of caffeine, 
caffeineless coffee would be obtained by preparing a transformed plant, wherein 
expression of gene encoding said enzyme is inhibited. 
[Selected Figure] None 



